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Abstract. We consider the cubic defocusing nonlinear Schrodinger equation on the two dimensional torus. We 
exhibit smooth solutions for which the support of the conserved energy moves to higher Fourier modes. This 
weakly turbulent behavior is quantified by the growth of higher Sobolev norms: given any 5<C 1,K ^ l,s > 1, 
we construct smooth initial data ug with ||ito||Hs < so that the corresponding time evolution u satisfies 
||u(r)||£fs > K a.t some time T. This growth occurs despite the Hamiltonian's bound on and despite 

the conservation of the quantity ||tt(t)|| . 

The proof contains two arguments which may be of interest beyond the particular result described above. 
The first is a construction of the solution's frequency support that simplifies the system of ODE's describing each 
Fourier mode's evolution. The second is a construction of solutions to these simpler systems of GDE's which begin 
near one invariant manifold and ricochet from arbitrarily small neighborhoods of an arbitrarily large number of 
other invariant manifolds. The techniques used here are related to but are distinct from those traditionally used 
to prove Arnold Diffusion in perturbations of Hamiltonian systems. 
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(1.1) 



1. Introduction 

We consider the periodic defocusing cubic nonlinear Schrodinger (NLS) equation 

idtu + Au = \u\'^u 



u(0, x) :— uq{x) 



where u{t,x) is a complex valued function with the spatial variable x lying in the torus := M^/(27rZ)^ 



Equations such as (1.1 1 arise as models in various physical settings, including the description of the envelope of 
a general dispersive wave in a weakly nonlinear medium, and more specifically in some models of surface water 
waves. (See e.g. the survey in [5^, Chapter 1.) 

We shall always take the initial data uo{x) to be smooth. Recall (see e.g. [23] [SS]) that smooth solutions to 



(1.1 1 exhibit both conservation of the Hamiltonian, 
(1.2) ^E[um, 
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and conservation of mass, or L^(T^) norm, 
(1.3) 



u\^dx{t) = / lul-'dxiO), 

for all t > 0. The local-in-time well-posededness result of Bourgain 9 for data uq G H^{T^), s > 0, together with 
these conservation laws, immediately gives the existence of a global smooth solution to from smooth initial 
data. 



We are interested in whether there exist solutions of ( 1 . 1 1 which initially oscillate only on scales comparable 



to the spatial period and eventually oscillate on arbitrarily short spatial scales. One can quantify such motion in 
terms of the growth in time of higher Sobolev norms ||ii(t)||/fs(T2), s > 1, defined using the Fourier transform by, 

(1.4) ||u(t)|U=(T2) :=||u(i,-)||H=(T^) := (j2{ny\uit,n)\] 

where {n) := (1 + |rtp)5 ancj^ 



u{t, n) 



f2 



u(i, a;)e-"-^(ia;. 



For example, together (1.2 1 and (1.3 1 give a uniform bound 



(1.5) 



< C 



on the solution's norm. Hence, for fixed s > 1, ||u(f)|j/fs(T2) could grow in time if the terms contributing 
substantially to the sum on the left hand side of (1.5) correspond, loosely speaking, to ever higher \n\. From this 
point of view then, we are interested in whether the energy of a solution to ( |1.1[ ) can be carried by higher and 
higher Fourier modes. 

The one space-dimensional analog of ( |1.1| is completely integrable [55], and the higher conservations laws 
in that case imply ||M(t) < C(||u(0)||^fs(Ti)), s > 1 for all t > Q. It is unknown (see e.g. [20]) whether 
unbounded growth in , s > 1, is possible in dimensions 2 and higher, let alone whether such growth is generic. 
The main result of this paper is the construction of solutions to ( |1.1[ ) with arbitrarily large growth in higher 
Sobolev norms. 

Theorem 1.1. Let \ < s. K ^ 1, and Q < 5 <^ \ he given parameters. Then there exists a global smooth solution 
u(t, x) to ( 1.1 1 and a time T > with 

\\u{0)\\h^ <S 

and 

\\u{T)\\h^ > K. 



Note that, in view of (1.2), (1.3), the growth constructed here must involve both movement of energy to 
higher frequencies, and movement of mass to lower frequencies. (The mass associated to the higher and higher 
frequency energy must be decreasing by energy conservation. This must be balanced, by mass conservation, by 



more and more mass at low frequencies.) Recall again that any smooth data in (1.1 1 evolves globally in time. 
While finite and infinite time blowup results are known for focusing analogs of (1.1 1 (e.g. [Ti], [TB], [4(1, [50], 



[54j ) , the mechanism responsible for the iJ" norm growth in Theorem |1.1| is distinct from these blowup dynamics 

^In what follows, we omit the factors of 2it arising in definitions of the Fourier transform and its inverse, as these play no role in 
our analysis. 
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Using the conservation laws (1.2), (1.3 1 and the Sobolev embedding theorem, we observe that, for s = 1, we 



have a stability property near zero. 



lim sup 

, \t\^oo 



sup \\u{t)\\Hi 
"oil wi <'5 



< C5. 



Theorem 1 1 . 1 1 implies a different behavior in the range s > 1. Since S may be chosen to be arbitrarily small and 
K may be chosen arbitrarily large in Theorem |1.1| we observe the following: 

Corollary 1.1 instability of zero solution). The global-in-time solution map taking the initial data uq to the 
associated solution u of ( |1.1[ ) is strongly unstable in H'^ near zero for all s > 1: 



(1.6) 



inf limsup 



5>0 



W- 



sup \\u{t)\\H- 

I "0 II ffs<<5 



It does not follow from (1.6 1, nor directly from Theorem |1.1| that there exists initial data uq g for some 
s > 1 which evolves globally in time and satisfies limsup|j|_,o2 ||u(i)||i/s = oo. As remarked above, this remains an 
interesting open question [20]. In Section|5]below, we do prove that ( |1.1[ ) has no nontrivial solutions which scatter 
-i.e. which approach a solution to the linear equation at time t = oo. Note that Theorem |1.1| and Corollary [1.1 1 
also hold true for higher dimensional generalizations of (1.1) by considering solutions which are only dependent 
upon two of the spatial coordinates. 

Theorem 1 1.1 1 is motivated by a diverse body of literature which we briefly review now, including upper bounds 
on the possible growth of Sobolev norms for ( |1.1[ ), lower bounds for related models, and extensive work on the 
so-called weak-turbulence theory of related wave models. 

A straightforward iteration argument based on the local theory [3] shows that high Sobolev norms of solutions 
of (1.1 1 can grow no faster than exponential-in-time. Bourgain used refinements |15j of the Strichartz inequality 
to prove polynomial-in-time upper bounds |12j on Sobolev norm growth. These results were sharpened, using a 
normal forms reduction, in [13^. (See also [25], [55].) 



are, to the best of our knowledge, 
[46]). where the following small 



Previous examples of growth in the higher Sobolev norms of solutions to ( [1.1 
found only in the work of Kuksin [43] (see also related work in 142] , [44] , 145 
dispersion nonlinear Schrodinger equation is considerecQ 

(1.7) -idtW + 6/^w^ 

with odd, periodic boundary conditions, where 5 is taken to be a small parameter. Kuksin shows, among other 



\w\'^w 



results and in various formulations, that smooth norms of solutions of (1.7 1 evolving from relatively generic data 



with unit norm eventually grow larger than a negative power of 5. This result can be compared with Theorem 
as follows. Suppose w is a solution of (1.7l. The rescaled function us{t,x) = S^'sw{d~^t,x) satisfies the 
PDE (1.1), the same equation as (1.7) with 6 = 1. Note that \\D^us{0, = (5^2 ||£)Ji(;(0, •)||^2 . Hence, in 



the context of ( 1 . 1 1 , the above described result in [43^ addresses growth of solutions emerging from sufficiently 
large initial data, where the size of the data depends on the amount of growth desired. In contrast. Theorem [1.1| 
concerns growth from small, or order one, dataj^and its proof involves a strong interplay between the equation's 
dispersion and nonlinearity. 



^In these papers, Kuksin considers a variety of deterministic and random nonlinear Schrodinger equations in addition to ( |1.7[ l. In 
|45| the analog of | |1.7| f or the second order wave equation is considered. Note too that we write (1.7) with the sign convention on the 
time variable as in rather than the convention used in |43| . but this makes no substantial difference in describing the results. 

•^Theorem 1 1.1 1 is stated for the case 5 <g; 1. However as we hope is clear from the discussion in section 2.6 below, the construction 
allows any choice of 5 > 0. 
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In contrast to the question for the nonhnear problem ( |1.1[ ) , there are several results on the growth of Sobolev 
norms in linear Schrodinger equations with potentials, 

(1.8) -idtv + Av + V{t,x)v^O, xeT^. 

For example, for V smooth in space, and random in time (resp. smooth and periodic in x and t), Bourgain proved 
that generic data grows polynomially f21J (resp. logarithmically [17]) in time. 

Bourgain has constructed jTU], [IT] Hamiltonian PDEs with solutions with divergent high Sobolev norms. 
These constructions are based on perturbation arguments off linear equations with spectrally defined Laplacian 
and also involve somewhat artificiaj^ choices of nonlinearities. In [T?, solutions with divergent Sobolev norms 
are constructed for a wave equation with a natural cubic nonlinearity but still involving a spectrally defined 
Laplacian. 



Bourgain has also shown [TS], [T5] that there is an abundance of time quasi-periodic solutions of (1.1 1 



As mentioned at the outset, the growth of higher Sobolev norms is just one way to quantify the diffusion 
of energy to higher and higher modes - also called a forward cascade, or a direct cascade. Furthur motivation 
for Theorem |1.1[ and indeed for much of the work cited above, is the literature of analysis, physics, numerics, 
heuristics and conjectures regarding this phenomenon in related wave models. For example, since the early 
1960's a so-called weak turbulence theory (alternatively wave turbulence theory) has been developed which gives 
a statistical description of the forward cascade in various "weakly interacting" dispersive wave models, mainly 
based on the analysis of associated kinetic equations for the evolution of the Fourier modes [31 IH [37l Hll [61] . 
There are a great many subsequent works both within and outside the framework of weak turbulence theory 
which address the passage of energy to higher modes in dispersive wave models. We are not prepared to give here 
a representative survey, but see [311 (311 HIl UHl [S3] and references therein for examples. 

In broadest outline, the proof of Theorem 1 1 . 1 1 proceeds by first viewing ( |1.1[ ) as an infinite dimensional system 
of O.D.E.'s in {a„(i)}„gz2, where a„(t) is closely related to the Fourier mode u{t,n) of the solution. We identify 
a related system, which we call the resonant system, that we use as an approximation to the full system. The goal 
then will be to build a solution {r„(t)}„gz2 to the resonant system which grows in time. This is accomplished 
by choosing the initial data {fn(0)}„gz2 to be supported on a certain frequency set A C Z'^ in such a way that 
the resonant system of O.D.E.'s collapses to an even simpler, finite dimensional system that we call the Toy 
Model System, and whose solution we denote by b{t) = {bi{t) , b2{t) , . . . , 6jv(i)}. Each variable bi{t) will represent 
how a certain subset of the {r„(t)}„g22 evolves in time. There are two independent but related ingredients 
which complete the proof of the main Theorem. First, we show the existence of the frequency set A, which is 
defined in terms of the desired Sobolev norm growth and according to a wish-list of geometric and combinatorial 
properties aimed at simplifying the resonant system. Second, we show that the Toy Model System exhibits 
unstable orbits that travel from an arbitrarily small neighborhood of one invariant manifold to near a distant 
invariant manifold. It is this instability which is ultimately responsible for the support of the solution's energy 
moving to higher frequencies. Instabilities like this have been remarked on at least as far back as [5S], but 
have been studied with increasing interest since the paper of Arnold [5] . Our construction has similarities with 
previous work on so-called "Arnold Diffusion" and more general instabilities in Hamiltonian systems (see e.g. 
[ll[2l[5l[6l[Il[8l[Ml[25l[2ll[Ml[29l[30l[3ll[Ml However, the particular approach taken 

here seems to be different than arguments presently in the literature and might be of independent interest. There 
are several different definitions of "Arnold Diffusion" (see e.g. the remarks in the introduction of the survey by 
Delshams, Gidea, and de la Llave in [IH])- Our use of this term for the orbits described above, which travel close 
to a sequence of invariant manifolds and lead to a change in the action variable of order 1, is in line with some, 

^See Remark 2 on p. 303 of .12. for further discussion. 
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but not all of these conventions. We emphasize also that "diffusion" is used here as in much of the literature, in 
a nontechnical - even colloquial - sense following [5]. 

The paper is organized as follows. Section [2] provides a more detailed overview of the argument, giving the 
proof of Theorem 1 1 . 1 1 modulo some intermediate claims. Section [3] contains a proof that the Toy Model System 
exhibits the unstable orbits described above. Section 4| co nstructs the frequency set A. Finally, we prove as a 
postscript in Section |5] that no nontrivial solutions to (1.1 1 scatter to linear solutions. 



2. Overview and Proof of Main Theorem 



2.1. Preliminary Reductions: NLS as an Infinite System of ODE's. The equation (1.1) has gauge free- 
dom: upon writing 

(2.1) v{t,x) = e''^*'u{t,x), G e M {constant), 



the NLS equation (1.1 1 becomes the following equation for w, 

(2.2) {~idt + ^)v={G+\v\')v 

with the same initial data. (We will soon choose the constant G to achieve a cancellation.) 

We make the Fourier Ansatz, motivated by the explicit solution of the linear problem associated to NLS. For 



solutions of the NLSq equation (2.2 1, we write 

(2.3) v{t,x) = a«(i)e''"-"+l"l'*^ 



We consider in this paper only smooth solutions, so the series on the right of (2.3 1 is absolutely summable. 
Substituting (2.3 1 into the equation ( |1.1[ ) and equating Fourier coefficients for both sides gives the following 
infinite system of equations for a„ (t) , 



ni,n2,n3GZ 
"1— "2+n3=" 



(2.4) 

where 
(2.5) 

Certain terms on the right hand side of (2.4 1 can be removed by correctly choosing the gauge parameter G. We 



W4= |ni|'-|n2|' + |n3|'-|n|' 



describe this cancellation this now. Split the sum on the right hand side of (2.4) into the following terms, 
(2.6) 

^ = ^ + ^ + ^- ^:= Term I + Term II + Term III + Term IV. 

ni,n2,n3tE2^^ ni,n2,n3€:Z^ ni,n2,n3GZ^ ni,n2,n3GZ^ ni,n2,n3GZ^ 
ni—n^+n^—n ni— 712+^3— n n-i—n2+n^—n ni—n^+n^—n ni—n^+n^—n 
ni^n^^n n\—n n^—n n^—ni—n 

Term IV here is not a sum at all, but rather — a„(i)|o„(i)p. Terms II and III are single sums which by Plancherel's 

2a„(t)- ^ \a^{t)\^ = 2an{t) ■ \\u{t)\\ 
= 2a„(i)M2, 



theorem and (1.3) total 



li2(T2) 



m£2 
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where we've written M :— \\u{t)\\ ^2 ^j^y We can cancel this with the first term on the right side of (2.4| by 
choosing G = —2M in (2.1 1. Equation (2.4) takes then the following useful form, which we denote TNLS, 



(2.7) 



-an\ar, 



E 



f^ni ^712 ^713 6 



ni ,n2 ,n36r(n) 

where 

(2.8) r(n) = {(^1,^2,^3) e (Z^)^ : ni - 712 + ng = n,ni ^ ","3 7^ 

Note that at each time u is easily recoverable from v, and both functions have identical Sobolev norms. 

One can easily show that TNLS is locally well-posed in Z^(Z^), and for completeness we sketch the argument 
here. Define the trilinear operator 



by 



(2.9) (A/'(t)(a, 6, c))„ = -a„6„c„ + ^ a^^bn^^n^ 

TLi .712 ,7l3£r(n) 

With this notation, we can reexpress TNLS as —idtUn — {M(t){a, a, a))„. 
Lemma 2.1. 

(2-10) ||(7V(t)(a,6,c))„||;i(2;2) < l|a||,i(22)||6||,i(22)||c||,i(22) 



Proof. The norm of the first term in (2.9l is bounded by ||a||;oo ||6||^oo ||c||;i , which is bounded as claimed. For 



the second term, take absolute values inside the n2 n3er(n) then replace the sum by the n sum using 
the T{n) defining relation to observe the bound ( |2.10 1. □ 



Remark 2.1. From Lemma 2.1 and standard Picard iteration arguments one obtains local well-posedness in 
The local well-posedness result is valid on [0, T] with 

The equation —idta = Af{t){a,a,a) behaves roughly like the ODE dta = for the purposes of local existence 
theory. 



2.2. Resonant and Finite Dimensional Truncations of TNLS. We define the set of all resonant non-self 
interactions Tres{n) C r(n) by 

(2.11) Tres{n) = {(ni,n2,n3) e r(n) : uu^ = Imp - |n2p + jnaP - \n\^ - 0}. 

Note that (ni, n2, na) G Tres{n) precisely when (rii, ri2, ^3, n) form four corners of a nondegenerate rectangle with 
n2 and n opposing each other, and similarly for ni and n^. One way to justify this claim is to first note the 



following symmetry: the two conditions (2.8), (2.11) defining Tj-^giji) imply directly that {ni,n2,n^) € ^resi'fT') 



if and only if (ni — no,n2 — ?^0)'^■3 ~ "-o) G Tres{n — tiq) for any no e 1?. Choosing no = n, it suffices to prove 
the above geometric interpretation for T resin) in the case n — Q, and this follows immediately from the law of 
cosines. 



Heuristically, the resonant interactions dominate in (2.7) because they do not contain the e*'^** factor that 



oscillates in time. We approximate solutions of (2.7) by simply discarding the nonresonant interactions - and 
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we make this approximation rigorous with Lemma |2.3| below. For now, we simply define the resonant truncation 
RTNLS of TNLS by. 



(2.12) -iatr„ = -r„|r„|2+ ^ 



^ni ^712 • 



(ni,n2,n3)erres(n) 

Even after making the resonant approximation, we still have an infinite ODE to work with - n can range freely 
over 1? . Our strategy is to choose initial data for which the the system simplifies in several ways. 

Suppose we have some finite set of frequencies A that satisfies the following two properties: 

• (Property 1^: Initial data) The initial data r„(0) is entirely supported in A (i.e. r„(0) — whenever 
n ^ A). 

• (Property 11^^: Closure) Whenever (7^1,7^2,713,71,4) is a rectangle in 1? such that three of the corners lie 
in A, then the fourth corner must necessarily lie in A. In terms of previously defined notation, this can 
be expressed as follows, 

(2.13) {ni,n2,n-i) &Tres{ri),ni,n2,n^ ^ =^ n e A. 

Then one can show that r„(i) stays supported in A for all time. Intuitively, this is because the non-linearity 



in (2.12) cannot excite any modes outside of A if one only starts with modes inside A. 



For completeness, we include here the short argument that condition (2.13) guarantees a finite dimensional 
model if the finite set A contains the support of the initial data r(0) = {rn{0)}nez^ ^'^^ ( |2.12 1. 



Lemma 2.2. If A is a finite set satisfying Property Property IIp^ above, and r(0) 1 — > r{t) solves RJ-NLS 



(2.12) on [0,r] then for all t e [0,r] spt[r{t)] C A. 



Proof. Define, 

B{t) :^^|r„(t)p. 

ri0A 

Thus -B(O) = 0, and from the closure property and the boundednes^ of r„(t) we get the diff'erential inequality 

\B'{t)\<C\B{t)\. 

Thus, by the Gronwall estimate, B{t) = for all t. Hence, none of the modes outside of A are excited. □ 



Our strategy thus far is to choose initial data for (1.1 1 with Fourier support in such a set A - so that the 
resonant system (2.12) reduces to a finite dimensional system. We now place more conditions on the set A and 
on the initial data which bring about further simplifications. 



We demand that for some positive integer N (to be specified later), the set A splits into N disjoint generations 
A = Ai U . . . U Ajv which satisfy the properties we specify below, after first introducing necessary terminology. 
Define a nuclear family to be a rectangle (711,772,773,774) where the frequencies 711,713 (known as the "parents") 
live in a generation Aj, and the frequencies 712,714 (known as the "children") live in the next generation A^+i. 
Note that if (711, 7i2, 713, 714) is a nuclear family, then so is (711,714,713,712), (773,772,771,774), and (713,714,711,712); we 
shall call these the trivial permutations of the nuclear family. We require the following properties (in addition to 
the initial data and closure hypotheses described above): 

^The argument sketched previously for local well-posedness in of \2.7\ carries over to ||2.12| without change. 
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• (Property III^: Existence and uniqueness of spouse and children) For any 1 < j < and any ni G Aj 
there exists a unique nuclear family (711,712,^3,714) (up to trivial permutations) such that ni is a parent 
of this family. In particular each ni e Aj has a unique spouse 713 e Aj and has two unique children 
n2,n4 G Aj_|_i (up to permutation). 

• (Property IV^^: Existence and uniqueness of sibling and parents) For any I < j < N and any n2 G Aj^i 
there exists a unique nuclear family (711,712,713,714) (up to trivial permutations) such that 712 is a child 
of this family. In particular each n2 £ ^j+i has a unique sibling 774 S Aj+i and two unique parents 
77i , 773 G Aj (up to permutation) . 

• (Property V^: Nondegeneracy) The sibling of a frequency 77 is never equal to its spouse. 

• (Property Vlj^: Faithfulness) Apart from the nuclear families, A contains no other rectangles. (Indeed, 
from the Closure hypothesis, it does not even contain any right-angled triangles which are not coming 
from a nuclear family.) 



Despite the genealogical analogies, we will not assign a gender to any individual frequency (one could do so, 
but it is somewhat artificial); these are asexual families. Since every pair of parents in one generation corresponds 
to exactly one pair of children in the next, a simple counting argument now shows that each generation must 
have exactly the same number of frequencies. 



At present it is not at all clear that such a A even exists for any given N. But assuming this for the moment, 
we can simplify the equation (2.121. It now becomes 

(2.14) ~ 



idtr^it) = -|r„(t)|V„(t) + 2r„^,„,_,(i)r„^,„,_,(i)r„^^_(i) +2r„^_„,_,(t)r„^ 



where for each 77 G A 



J 7 ' ^spouse 



G Aj is its spouse, 77chiid-i, "■chiid-2 G ^J^^ its two children, nsmi. 



G Aj is its 



sibling, and Tiparcnt-i, '^parent-2 G its parents. If 77 is in the last generation A^r then we omit the term 

involving spouse and children; if n is in the first generation Ai we omit the term involving siblings and parents. 
The factor "2" arises from the trivial permutations of nuclear families. 



We now simplify this ODE by making yet another assumption, this time again involving the initial data; 



• (Property VII^: Intragenerational equality) The function n 1— > r„(0) is constant on each generation Aj. 
Thus I < j < N and 77,77' G Aj imply r„(0) = r„/(0). 



It is easy to verify by another Gronwall argument that if one has intragenerational equality at time then one 
has intragenerational equality at all later times. This is basically because the frequencies from each generation 
interact with that generation and with its adjacent generations in exactly the same way (regardless of what the 
combinatorics of sibling, spouse, children, and parents are). Thus we may collapse the function 77 1-^- r„(<), which 
is currently a function on A = Ai U . . . U An, to the function j 1-^ bj{t) on {1, . . . , N}, where bj{t) ; — r„(i) 
whenever 77 G Aj . Thus we describe the evolution by using a single complex scalar for each generation. The ODE 



(2.141 now collapses to the following system that we call the Toy Model System 
(2.15) - idtbjit) = -\b,{t)\H^it) +2bj_i{tfVjit) + 2bj+iitfbj{t), 

with the conventiorj^ that bo{t) = &7v+i(i) = 0. 

There are three main ingredients in the proof of Theorem |1.1[ The first ingredient is the construction of 
the finite set A of frequencies. The second ingredient is the proof that the Toy Model System described above 



''We show rigorously later that the support of b(t) is constant in time - so that bo, bjv+i will vanish as a consequence of our choice 
of initial data. In other words, feo(i) S'lid ''JV+i(i) remain zero for all time because of the dynamics induced by the resonant system, 
and not by a possibly artificial convention introduced alongside these new variables. 



10 



J. COLLIANDER, M. KEEL, G. STAFFILANI, H. TAKAOKA, AND T. TAO 



exhibits a particular instability: we show that there exist solutions of the Toy Model which thread through small 
neighborhoods of an arbitrary number of distinct invariant tori. More specifically, we show that we have a "multi- 
hop" solution to this ODE in which the mass is initially concentrated at 63 but eventually ends up at In 
terms of the resonant system (2.121, this instability corresponds to the growth of higher Sobolev norms. The 
third ingredient is an Approximation Lemma which gives conditions under which solutions of RJ-NLS (2.12), 
and hence solutions corresponding to the Toy Model System, approximate an actual solution of the original NLS 
equation. When the conditions of this Approximation Lemma are satisfied, it is enough to construct a solution 
evolving according to the Toy Model which exhibits the desired growth in iJ*. A scaling argument shows that 
these conditions can indeed be satisfied, and glues the three ingredients together to complete the proof. We now 
detail the claims of the three ingredients and prove Theorem 1 1 . 1 1 modulo these intermediate claims. 



2.3. First Ingredient: The Frequency Set A. The approximation to (2.7 1 that we ultimately study is the 
time evolution of very particular data under the equation with the nonresonant part of the nonlinearity removed. 
The very particular initial data u(0) that we construct has Fourier support on a set A C which satisfies one 
more important condition in addition to those described above. 

The construction of this frequency set A is carried out in detail in Section |4] Here we record the precise claim 
we make about the set. 

Proposition 2.1 (First Ingredient: the frequency set A). Given parameters 5 1, K ^ 1, we can find an N ^ 1 
and a set of frequencies A C with, 

A = Ai U A2 U . . . U A^r disjoint union 
which satisfies Property 11^ - Property VIj^ and also, 



(2.16) 



> 



In addition, given any TZ 3> C{K, S), we can ensure that A consists of N ■ 2^ ^ disjoint frequencies n satisfying 

\n\ > n. 



We will use the term generations to describe the sets Aj that make up A. The norm explosion condition 



(2.161 describes how in a sense, generation Ayv has moved very far away from the frequency origin compared to 
generation Ai. We will use the term inner radius to denote the parameter TZ which we are free to choose as large 
as we wish. 



2.4. Second Ingredient: Diffusion in the Toy Model. The second main component of the proof is, in 
comparison with the first one, considerably more difficult to prove. Our claim is that we can construct initial 
data for the Toy Model System (2.15 1 so that at time zero, 6(0) is concentrated in its third component 63 and this 
concentration then propagates from 63 to 64, then to 65 etc. until at some later time the solution is concentrated 
aln6A'-2- We will measure the extent to which the solution is concentrated with a parameter e. More precisely. 



Note that Property 1^^, Property VII^ will be easily satisfied when we choose our initial data. 
*^One could in fact construct solutions that diffuse all the way from bi to fejv by a simple modification of the argument, but to 
avoid some (very minor) technical issues we shall only go from 63 to feiv— 2- 
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Proposition 2.2 (Second Ingredient: Diffusion in the Toy Model). Given N > l,e ^ 1, there is initial data 
b{0) = (6i(0), 62(0), . . . , 6a,(0)) e for (pls]) and there is a time T = T{N, e) so that 



1^3(0)1 >f-e |fo,(0)|<e jV3, 
\hN-2{T)\>l-e \b,{T)\<e j^N-2. 

In addition, the corresponding solution satisfies \\b(t)\\iao ^ I for all <t <T. 



This Proposition will be recast in a slightly different language in Section [3] as Theorem 3.1 



2.5. Third Ingredient: The Approximation Lemma. This ingredient concerns the extent to which we can 



approximate the system of ODE's corresponding to the full NLS equation (2.7 1 by other systems of ODE's. The 



approximate system we will ultimately study is that coming from removing the non-resonant part of the Fourier 



transform of the equation's cubic nonlinearity (i.e. equation (2.12)). Here we write the approximation lemma in 
a more general form. 

Lemma 2.3 (Third Ingredient: Approximation Lemma). Let < a < I be an absolute constant (all implicit 
constants in this lemma may depend on a). Let B ^ 1, and let T ^ B^logB. Let 

9{t) ■■= {9n{t)}neZ^ 

be a solution to the equation 

(2.17) -^^tg{t) = mt){g{t), g{t), git))) + £{t) 



for times < t < T, where Af(t) is defined in (2.9), (2.5) and where the initial data g(0) is compactly supported. 



Assume also that the solution git) and the error term £{t) obey the bounds of the form 
(2.18) \\9miH7.-)<B-^ 



(2.19) 

for allO<t<T 



£{s) ds 



We conclude that if ait) denotes the solution to TNLS (2.7 \ with initial data a(0) = 5(0), then we have 
(2.20) ||a(i)-5(t)||;.(z2)< 5-1-^/2 

for allO<t<T. 



Proof of Lemma \2.3\ First note that since a(0) = g{0) is assumed to be compactly supported, the solution a(t) 
to (2.7) exists globally in time, is smooth with respect to time, and is in 1^{1?) in space. 



Write 



if £{s) ds, and d{t) -.^ g{t) + F{t). 
Jo 



Observe that 

-idt=Af{d- F,d- F,d- F), 

where we have suppressed the explicit t dependence for brevity. Observe that g = 0;i {B~^) and F = O/i {B~^~"') 
where we use 0;i(A) to denote any quantity with an norm of 0{X). In particular we have d = 0;i(_B^^) 



By trilinearity and (2.10) we thus have 



'idt = N{d, d, d) + 0,1 {B--^-") 
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Now write a := d + e. Then we have 

-i{d + e)t = Af{d + e,d+ e,d+ e), 



which when subtracted from the previous equation gives (after more trihnearity and (2.10)) 

iet = On{B-^-") + On{B-^\\e\\i) + On{\\e\\l), 
and so by the differential form of Minkowski's inequality, we have 

dt\\e\\,<B--'-'^ + B-^e\\, + \\e\\l. 

If we assume temporarily (i.e. as a bootstrap assumption) that ||e||i = 0{B^^) for all t e [0,T], then one can 
absorb the third term on the right-hand side in the second. Gronwall's inequality then gives 

||e||i = B-^-'^ cyiY>{CB-h) 

for all t G [0, T]. Since we have T B^ log-B, we thus have ||e||i ^ B~'^~'^/'^, and so we can remove the a priori 
hypothesis ||e||i = 0{B~'^) by continuity arguments, and conclude the proof. 

□ 



Lemma 2.3 gives us an approximation on a time interval of approximate length B^ log B, a factor log B larger 
than the interval [0, B^] for which the solution is controlled by a straightforward local-in-time argument. The 
exponent cr/2 can be in fact replaced by any exponent between and a, but we choose cr/2 for concreteness. 



2.6. The Scaling Argument and the Proof of Theorem [TTT} Finally we present the relatively simple scaling 
argument that glues the three main components together to get Theorem |1.1[ 



Given (5, if, construct A as in Proposition 2.1 (This is done in Section |4] below.) Note that we are free to 
specify TZ (which measures the inner radius of the frequencies involved in A) as large as we wish and this will be 
done shortly, with TZ = TZ{S, K). 

With the number N = N{5, K) from the construction of A (recall N represents the number of generations 
in the set of frequencies), and a number e = t{K, 5) which we will specify shortly, we construct a traveling wave 
solution h{t) to the toy model concentrated at scale e according to Proposition 2.2 above. This proposition also 
gives us a time Tq = Tq{K,5) at which the wave has traversed the N generations of frequencies. Note that the 
toy model has the following scaling. 



We choose the initial data for ( |1.1[ ) by setting 
(2.21) a„(0) 



6[^^(0) for all n e Aj 



and a„(t) = when n ^ A. We specify first the scaling parameter A and then the parameter TZ according to 
the following considerations which we detail below after presenting them now in only the roughest form. The 
parameter A is chosen large enough to ensure the Approximation Lemma 2.3 applies, with c„ the solution of the 
resonant system of O.D.E.'s (2.12 1 also evolving from the data (2.21 1, over the time interval [0, A^Tq] - which is 



the time the rescaled solution fo'^-* takes to travel through all the generations in A. In other words, we want to 



apply the Approximation Lemma 2.3 with a parameter B chosen large enough so that, 
(2.22) B^logB:^ X^Tq. 
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With A and B so chosen, wc will be able to prove that ||u(i)||//s(T2) grows by a factor of ^ on [0, X Tq]. We finally 
choose TZ to ensure this quantity starts at size approximately 5, rather than a much smaller scale. 



We detail now these general remarks. The aim is to apply Lemma 2.3 with c(t) 



{c„(i)}„6Z2 defined by, 



for rt G Aj, and c„(t) = when n ^ A. Hence, we set £(t) to be the non-resonant part of the nonlinearity on the 
right hand side of (2.7 1. That is. 



(2.23) 



£{t) := - 



[r(Ti)\r,es(n)]nA3 



where is as in (2.5 1. (We include the set A in the description of the sum above to emphasize once more that the 
frequency support of c{t) is always in this set.) We choose B = C{N)X and then show that for large enough A the 
required conditions ( |2.18[ ) and ( 2.19[ ) hold true. Observe that (2.221 holds true with this choice for large enough 
A. Note first that simply by considering its support, the fact that |A| = C{N), and the fact that ||6(t)||ioo ~ 1, 
we can be sure that, ||^(t)||ji(z) ~ C'(A^) and therefore 

(2.24) ||6(^)(^)II/MZ)' l|c(i)l|;i(z^) < ^''C{N). 

Thus, ( |2.18 1 holds with the choice B — C{N)\. For the second condition ( |2.19 1, we claim 



(2.25) 



£{s) ds 



< 



Note that this is sufficient with our choices B — X ■ C{N) and T — X^Tq. It remains only to show (2.25), but 
this follows quickly from the van der Corput lemma: since a'4 does not vanish in the set T(n)\T resin), we can 
replace e*""" by ^[^— — ] and then integrate by parts. Three terms arise: the boundary terms at s = 0, T and the 



integral term involving ^[c„j(s)c„2(s)cti3(s)]. For the boundary terms, we use (2.24) to obtain an upper bound 
of C{N)X^^. For the integral term, the s derivative falls on one of the b factors. We replace this differentiated 
term using the equation to get an expression that is 5-linear in c and bounded by C{N)X^^T. 

Once A has been chosen as above, we choose TZ sufficiently large so that the initial data c(0) = a(0) has the 
right size: 



(2.26) 



KneA 



This is possible since the quantity on the left scales like A^^ in A, and TZ'' in the parameter TZ. (The issue here 
is that our choice of frequencies A only gives us a large factor (that is, ^) by which the Sobolev norm of the 
solution will grow. If our data is much smaller than 5 in size, the solution's Sobolev norm will not grow to be 
larger than K.) 

It remains to show that we can guarantee, 
(2.27) ( ^ |a„(A2ro)nrfO > X, 

\neZ2 / 



where a{t) is the evolution of the data c(0) under the full system (2.7). We do this by first establishing, 
(2.28) (j2K{X^To)\'\nA > K, 



KneA 
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and second that, 
(2.29) 



Y,\cniX^To)-a^{X'To)\M^^)<l. 



As for (2.281, consider the ratio of this norm of the resonant evolution at time X^Tq to the same norm at time 0, 



Enez^ MOW 



2rr M2u|2s 



Ef=iE„eAj&r(A^ro)| 



(>)M2rp„M2|^|2s 



Ef=iE„eAjr^(0)P|nP^ 
since c„ := when n ^ A. We use now the notation Sj :— EneA l^-P*! 



Q = 



j:li\b^\x'ToWS, 



> 



SN-2{i - e) 



eSi + eS2 + (1 - e)<5'3 + e'S'4 + • • • + sSn 

5'Ar-2(l - e) 



Sn-2 ■ [e/^ + e/^ + (1 - e)s^ + e/^ 

(1-6) 



On-2 '-'iV-2 



> 



where the last inequality is ensured by Proposition 2.1 and by choosing e < C{N,K,6) sufficiently small. 



As for the second inequality (2.291, using Approximation Lemma 2.3 we obtain that 
(2.30) 



\2s 



neA 



VneA 



1 



by possibly increasing A and TZ, maintaining (2.26). Together, the inequalities (2.301, (2.201 give us immediately 



3. Diffusion in the Toy Model 



In this section we prove Proposition |2.2| above, which claims a particular sort of instability for the system 
which we call the Toy Model System, 

(3.1) dth, = -i\h,W + 2zV,{h]^^ + fo^^i). 



This system was derived in the discussion preceding equation (2.15) above. We write hit) for the vector {bj)j^ 



and begin with some general remarks about the system (3.1 1 



Note that ( |3.1| is globally well-posed in To see local well-posedness, observe that the system is of the 

form dtb — T{b, b, b) where T : P x P x P ^ P is a. trilinear form which is bounded on P. Local well-posedness 
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then follows from the Picard existence theorem. The time of existence obtained by the Picard theorem depends 
on the P norm of b, but one quickly observes that this quantity is conserved. Indeed, we have 

dt\bj\^ = 2Rebjdtbj 

(3.2) =4Rei hj^h']_^ + b'-^b)j^^ 

and hence by telescoping series we obtain the mass conservation law dt l^jP = 0- Note that all these formal 
computations are justified if b is in l"^ , thanks to the inclusion /^(Z) C 



Though our analysis won't use it explictly, we note next that (3.1 ) also enjoys the conservation of the Hamil- 
tonian 



4 

3 



Indeed, if one rewrites H{b) algebraically in terms of b and b as 



3 



then we see from (3.1 1 that 

db db' 

and thus 

dtH{b) ^dtb-^ + dtb-^ ^0. 
^ ' db db 

Again, it is easily verified that these formal computations can be justified in P(Z). 



The system (3.1) enjoys a number of symmetries. Firstly, there is phase invariance 

b,{t)^e%{t) 

for any angle 6; this symmetry corresponds to the P conservation. There is also scaling symmetry 

bj{t) ^ Xb{XH) 

for any A > 0, time translation symmetry 

for any r g R (corresponding to Hamiltonian conservation, of course), and space translation symmetry 

bjit) ^ b,^,„it) 
for any jo € Z. Finally, there is time reversal symmetry 

6,(t)^57(-i) 

and space reflection symmetry 

bj{t)^b_j{t). 

There is also a sign symmetry 

bj{t) ^ e,b^{t) 

where for each j , ej = ±1 is an arbitrary sign. 
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Next we show that the infinite system (3.11 evolving from P data reduces to a finite system if the data is 



supported on only a finite number of modes 6^. Indeed (3.2 1 gives the crude differential inequality 



dt\b,\' = 2Re{bjdtbj) 
= 4Re(i67^(6|_ 
<4|&,f 



for any j G Z. From Gronwall's inequality we conclude first that if b is an solution to ( |3.1| , then the support 
of b is nonincreasing in time, i.e. if 6(io) is supported on / C Z at some time tg € M, then b{t) is supported in / 
for all time. The time reversal symmetry allows us to conclude that in fact the support of b is constant in time. 



In particular, if / is finite, then (3.1 1 collapses to a finite-dimensional ODE, which is obtained from (3.1) by 
setting bj = for all j ^ I. 



We conclude our general remarks here on the dynamics of (3.11 by observing how this evolution, and the 



assumptions on A that went into its derivation, account for the necessary balance in mass at high and low 
frequencies dictated by the conservation laws (1.2) and (1.3). As discussed immediately following Theorem |1.1[ 
this balance - i.e. the presence of both a forward and inverse cascade of mass - is an important complication to 



constructing any solutions to ( 1.1 ) that carry energy at higher and higher frequencies. Recall that the Toy Model 



comes from imposing a host of assumptions on the initial data for the resonant truncation system (2.12), which 



also has conserved mass, energy, and momentum. Under the assumptions that led to the Toy Model System, the 
conserved mass becomes essentialljj^ 



(3.3) 

the conserved momentum essentially becomes 
(3.4) 



E 



neAj- 



and the conserved energy now essentially becomes 

(3.5) E E I'^i') + ^ E i^^-wi' + E \bAt)\%+i{t)?- 

J neAj j j 

It looks like these three quantities are quite different. However, one observes the identities 

E E " 



and 



ngA, 



riGAj 



rieAj + i 



for all I < j < N . This simply reflects the fact that in any nuclear family (ni, n2, na, 714), one has ni + n^ = 712+^4 



and |rii| 



l"3l 



In^l . Thus the conservation of momentum follows trivially from the conservation of 



mass, and the conservation of energy is now equivalent to the conservation of the quartic expression 



^We are systematically ignoring a harmless multiplicative constant. 
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We turn now to the more specific, unstable behavior in (3.1 1 which we aim to demonstrate. Let TV > 1 be a 

be the unit-mass sphere 



fixed integer and let S C 



E {a; e 



Assigning initial data 6(to) with 6j(to) — for j < 0,j > N, (3.11 generates a group S{t) : E — > E of smooth 
flows on the smooth 2N — 1-dimensional compact manifold S, defined by S{t)b{tQ) := b{t + tg)- We shall refer to 
elements a: of E as points. 

Define the circles Ti, . . . , Tat by 

Tj := {{h, . . . , 6Ar) e E : - 1; bk - for all k ^ j} 

then it is easy to see from the above arguments that the flows S{t) leave each of the circles Tj invariant: S{t)Tj — 
Tj. Indeed for each j we have the following explicit oscillator solutions to (3.1 1 that traverse the circle T_,-: 



(3.6) 

Here 9 is an arbitrary phase. 



bk{t) for all fc ^ J. 



The main result in this section is that there is Arnold diffusion between any of these two circles, for instance 
between the thircPl circle T.s and the third-to-last circle Tj^_2- 



Theorem 3.1 (Arnold diffusion for (3.1 1). Let N > 6. Given any e > 0, there exists a point within e 0/T3 
(using the usual metric onT,), a point xm~2 within e o/Tjv-27 O'Tid a tim}^^t > such that S{t)x3 — xm~2- 



To state this result more informally, there exist solutions to (3.11 of total mass 1 which are arbitrarily con- 
centrated at the mode j = 3 at some time, and then arbitrarily concentrated at the mode j = — 2 at a later 
time. 

To motivate the theorem let us flrst observe that when A = 2 we can easily diffuse from Ti to T2. Indeed in 
this case we have the explicit "slider" solution 

(3.7) b,{t):^ b2{t):^^^^^ 

VI + e2v^* VI + e-2v^* 

where a; := e^'^*/^ is a cube root of unity. This solution approaches Ti exponentially fast as t — > —00, and 
approaches T2 exponentially fast as t ^ +00. One can translate this solution in the j parameter, and obtain 
solutions that "slide" from Tj to T^+i (or back from T^+i to T^, if we reverse time or apply a spatial reflection). 
This for instance validates the A^ = 6 case of Theorem |3.1| Intuitively, the proof of Theorem |3.1| for higher A^ 
should then proceed by concatenating these slider solutions. Of course, this does not work directly, because each 
solution requires an infinite amount of time to connect one circle to the next, but it turns out that a suitably 
perturbed or "fuzzy" version of these slider solutions can in fact be glued together. 



mentioned previously, one could in fact diffuse all the way from Ti to In by a simple modification of the argument, but to 
avoid some (very minor) notational issues near the endpoints we shall only go from T3 to Tjv— 2- 

"'^"'^We shall only flow forwards in time here. However, the time reversal symmetry bj (t) 1— > bj (—t) (or the spatial reflection symmetry 
bj{t) I— > 6jv+l— allows one to obtain analogues of all these results when one evolves backwards in time. Of course, when doing 
so, the stable and unstable modes that we describe below shall exchange roles. 
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Figure 1. Explicit oscillator solution around Ti and the slider solution from Ti to T2 



3.1. General Notation. We shall use the usual 0{) notation, but allow the constants in that notation to depend 
on N. Hence X = 0{Y) denotes X < CY for some constant C that depends on N, but is otherwise universal. 
We use X <Y as a synonym for X — 0{Y). 

We will also use schematic notation, so that an expression such as 0{fgh) will mean some linear combination 
of quantities which resemble fgh up to the presence of constants and complex conjugation of the terms. Thus for 
instance + 2fgh — ifgh qualifies to be of the form 0{fgh), and \b\^b qualifies to be of the form 0{b^). We 
will extend this notation to the case when /, g, h are vector- valued, and allow the linear combination to depend 
on N. Thus for instance if c = (ci, . . . , cat) then X^jLi^ CjCj+i would qualify to be of the form Oic^). 



3.2. Abstract Overview of Argument. 



Off the expressway, over the river, off the billboard, through the window, off the wall, nothin' but 
net. Michael Jordan [3H] 



To prove Theorem 3.1 one has to engineer initial conditions near T3 that can "hit" the target Tjv-2 (or more 
precisely a small neighborhood of Tjv-2) after some long period of time. This is difficult to do directly. Instead, 
what we shall do is create a number of intermediate "targets" between T3 and Tjv-2, and show that in a certain 
sense one can hit any point on each target (to some specified accuracy) by some point on the previous target 
(allowing for some specified uncertainty in one's "aim"). These intermediate trajectories can then be chained 
together to achieve the original goal. To describe the strategy more precisely, it is useful to set up some abstract 
notation. 

Definition 3.1 (Targets). A target is a triple (M, d, R), where M is a subset of E, d is a semi-metricp^ on S, and 
i? > is a radius. We say that a point a; G E is within a target (M, d, R) if we have d{x, y) < R for some y E M. 
Given two points x,y € S, we say that x hits y, and write x t-^ y, if we have y = S{t)x for some i > 0. Given 
an initial target (Mi, di,Ri) and a final target (M2, (i2, R2), we say that (Mi, di, i?i) can cover (M2, ^2,-^2), and 
write (Ml, di, i?i)^(M2, ^2, ^2), if for every X2 € M2 there exists an xi € Mi, such that for any point yi G E 
with d{xi,yi) < i?i there exists a point y2 (z with d2{x2,y2) < R2 such that yi hits y2- 



^^A scmi-mctric is the same as a metric, except that d{x, y) is allowed to degenerate to zero even when x ^ y. The reason we 
need to deal with semi-metrics is because of the phase symmetry x i— > e^^x on S; one could quotient this out and then deal with 
nondegenerate metrics if desired. 
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Remark 3.1. One could eliminate the radius parameter R here by replacing each target {M,d,R) with the 
equivalent target {M,d/R, 1), but this seems to make the concept slightly less intuitive (it is like replacing all 
metric balls Bd{x,R) with unit balls 1) in a renormalized metric). 

Remark 3.2. The notion of covering may seem somewhat complicated (involving no less than five quantifiers!), 
but it can be summarized as follows. The assertion (Mi, rfi, i?i)^(-M2, (i2, ^2) is a guarantee that one can hit 
any point in the final target M2 that one desires - within an accuracy of R2 in the c?2 metric - by "aiming" 
at some well-chosen point in the initial target Mi and then evolving by the flow, even if one's "aim" is a little 
uncertain (by an uncertainty of i?i in the di metric) . Thus the concept of covering is simultaneously a notion of 
surjectivity (that the flowout of Mi contains M2 in some approximate sense) and a notion of stability (that small 
perturbations of the initial state lead to small perturbations in the final state) . See also Figure [2] 




Figure 2. Mi^M2. The shaded disk of radius ri around xi flows out to the shaded area near 
M2 inside a disk of radius r2 around X2 ■ 

Informally, one can think of a target (M, d, i?) as a "fuzzy" set {x + Od{R) ■ x £ M}, where Od{R) represents 
some "uncertainty" of extent R as measured in the semi-metric d. Thus, for instance, if we were working in M? 
with the usual metric d, and M was a rectangle {{xi, X2) '■ \xi\ < ri, \x2\ < 7'2} then one might describe the target 
{M, d, R) somewhat schematically as 

{{0{ri) + 0{R),0{r2) + 0{R))} 

where the first term in each coordinate represents the parameters of the set (basically, they describe the points 
that one can "aim" at), and the second term in each coordinate represents the uncertainty of the set (this describes 
the inevitable error that causes the actual location of the solution to deviate from the point that one intended to 
hit). This schematic notation may be somewhat confusing and we shall reserve it for informal discussions only. 

One of the most important features of the covering concept for us is its transitivity. 

Lemma 3.1 (Transitivity). If{Mi,di,Ri), {M2,d2, R2), {M^jds, R3) are targets such that {Mi, di, Ri)~^{M2,d2, R2) 
and (Af2,d2,i?2)^(M3,d3,i?3), then {Mi,di,Ri)^{M3,ds,Rz). 

Proof. Let X3 be any point in A/3. Since (Af2, ^2, -R2)->*(Ai3, ^3, -R3), we can find X2 € Af2 such that for every 7/2 
with c?2(a^2,y2) < R2, there exists t/3 with d^{x^,y^) < R3 such that 2/2 '-^ 2/3- Since (Afi, di, _Ri)^(Af2, (^2, ^2), 
we can find an xi G Mi such that for every yi with di{xi,yi) < we can find 2/2 with (^2(2:21 2/2) < R2 such 
that 2/1 1-^ 2/2- Putting these two together, we thus see that for every 2/1 with di(xi, 2/1) < we can find 2/3 with 
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M 




Figure 3. Mi^M2, M2^M^ imply Mi^M^. The shaded disk of radius ri centered xi flows 
out onto the double shaded portion of the disk of radius r2 centered at X2- The disk of radius r2 
centered at X2 maps into the disk of radius near M3. 



d2{xz,yz) < R3 such that j/i 
claim follows. 



y2 



j/3, which by the group properties of the flow imply that yi 



2/3, and the 
□ 



The reader is invited to see how the transitivity of the covering relation follows intuitively from the interpre- 
tation of the concept given in Remark 3^ See also figure |3] 



We can now outline the idea of the proof. For each j = 3, . . . , — 2, we will define three targets which lie 
fairly close to T^-, namely 

• An incoming target {M~ ,d~ , Rj) (located near the stable manifold of Tj), 

• A ricochet target {Mj,(fj,R°) (located very near Tj itself), and 

• An outgoing target {M^,d'f,R'^) (located near the unstable manifold of T^). 



We will then prove the covering relations 

(3.8) (Mr , dj , R.J)^{M°, d°, R°) for all 3 < j < - 2 

(3.9) (M°, d°, R^)^{M+,d+,R+) for all 3 < j < TV - 2 

(3.10) (M+, d+, i?+)^(A/r^i, dv+i, i?7+i) for all 3 < j < TV - 2 

which by many applications of Lemma |3.1| implies covering relation 

(3.11) {Ml dl Rl)^{M%_^,dl_2,R%_2)- 
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As we shall construct (Afg , rfg, _R§) to be close to T3 and {M'^_2, d%_2, R%_2) to be close to Tat. 
3.1 will follow very quickljr^ from (3.11 1. 



Theorem 



3.4 



Our exposition is structured as follows. In Section 
{M^ ,dj' , Rj'), after construction of some useful local coordinates near each circle Tj in Section 



we shall define the targets (Mr^dJ^R-), {M°, d°,R°), 



Section 3.5 see why (3.111 implies Theorem |3.1[ In Section 3.6 we establish the incoming covering estimate (3.8), 



3.3 



3 ' J' J' 
and then in 



and in Section 3.8 we establish the (very similar) outgoing covering estimate. Finally in Section 3.10 we establish 



the (comparatively easy) transitory covering estimate (3.101 and hence (3.111 



3.3. Local Coordinates near Tj. Fix i < j < N ~ 2. In this section we shall devise a useful local coordinate 
system around the circle that will clarify the dynamics near that circle, and will motivate the choice of targets 
{M-,d-,R-), (M]',dO,i?^"), {M+,d+,R+) involved. 

We will assume here that bj ^ 0; as mentioned in the introduction, this constraint is preserved by the flow. We 
shall refer to the hj mode as the primary mode, the mode hj^\ as the trailing secondary mode, the mode as 
the leading secondary mode, the modes 61, ... , 6j-2 as the trailing peripheral modes, and the modes &j+2, ■ ■ • , ^JV 
as the leading peripheral modes. In the vicinity of the circle Tj, it is the primary mode that will have by far the 



most mass and will thus dominate the evolution. From (3.11 we see that the secondary modes will be linearly 
forced by the primary mode. The peripheral modes will only be influenced by the primary mode indirectly (via its 
influence on the secondary modes) and their evolution will essentially be trivial. At this stage there is a symmetry 
between the leading and trailing modes, but later on we shall break this symmetry when trying to construct a 
solution that evolves from the 3 mode to the — 3 mode (requiring one to be far more careful with the leading 
modes than the trailing ones). 

Bearing in mind the phase rotation symmetry x 1— > xe^^ we shall select the ansatz 
(3.12) hj = re'^ bk = Ckc''^ for k^j 

where r, 9 are real and the Ck are allowed to be complex (again we assume bj 7^ 0). In other words, we are 
conjugating the secondary and peripheral modes by the phase of the primary mode. Substituting these equations 



into (3.1 ) gives 



(3.13) dtCj±i + icj±idtO = —i\cj±i\^Cj±i + 2ir'^Cj±i + 2jc|_|_2Cj±i for ± 1 = +1,-1 

(3.14) 9(61 = -r2 + 2Re(c^2_i + c2+i) 

(3.15) dtCk + ickdtO = -i\ck\^Ck + 2icj:{cl_^ + c^^^) for \j -k\>2. 

We have not stated the equation for r (the magnitude of the primary mode) explicitly, since on E one can recover 
r from the other coordinates by the conservation of mass which gives the formula 

(3.16) r^ = l-Y,\ck\'. 
In particular we have the crude estimates 



^■^Readers familiar with the original paper of Arnold [2] will see strong parallels here; the notion of one set "obstructing" another 
in that paper is analogous to the notion of "covering" here, the targets {M~ , d~ , R~ ) are analogous to "incoming whiskers" for the 
torus Tj, and the targets i^^j'^'^^^'''^) ^^'^ "outgoing whiskers". Unfortunately, it seems that we cannot use the machinery from [2] 
directly, mainly because our invariant manifolds have too small a dimension and so do not have the strong "transversality" properties 
required in [2]. This requires a certain "thickening" of these sets using the above machinery of covering of targets. 
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where we use schematic notation, thus 0{c^) denotes any quadratic combmation of the and their conjugates. 
We can substitute this into (3.151 to obtain an equation for the evolution of the peripheral modes 

(3.17) dtCk^iCk + OickC^). 



Now we turn to the secondary modes (3.13). Fix a sign ±. From (3.16), (3.141 we have 



r2 = l 



,±iP - 0(4±i); d^e =-l + |c,±i|2 + 2Re(c2±i) + 0(4,±i) 

. , Cj-i, Cj^2, . . . , Cat) and c^j_i :— (ci, . . . , Cj_2, Cj+i, . . . , c^)- Substituting this 



where we write c^j+i :— (ci, . 
into (3.13) we have 

dtCj±i = icj±i + 2icJ^ - 2i\cj±ifcj±i - 2iRe(c|_ti)cj±i - 2i|cj±ipc^+ 0(cj±ic^jj_i). 
It is then natural to diagonalize the linear component of this equation by introducing the coordinates 
(3.18) 

where as in (3.7) uj := e^'"'^!'^ . One then computes 



icjii + 2icj±i — —VSi 



LUC 



2^+ 



iW c 



and 

and thus 
(3.19) 



Re(cj±i) = --|c,±i| 



0{c 



i) 



dtc 



(1 



in(-y3< 



\/3^'c+±i) + 0(c,±ic^.±ic+±i) + 0(c,±i4j.±i). 



Taking components, we conclude 



Proposition 3.1 (Local coordinates near Tj). Let 3 < j < N — 2, and let b{t) be a solution to (3.1 1 with hj{t) ^ 0. 
Define the coordinates r, 9 (primary mode), c~^^, c^^^ (secondary modes), and c* := (ci, . . . , Cj^2, Cj+2, . • . , c^r) 
(peripheral modes) by (3.121, ( 3.18[ ). Then we have the system of equations 

(3.20) dtcj^, = -VScJ^, + Oic'c-^,) + Oic+^,c%^,) 

(3.21) dtc+^, = V3c+^, + 0(c2c+±i) + 0(c-±i4,±i) 

(3.22) dtc^^ic^+0{c^c^). 



Also, the constraint bj{t) ^ Q is equivalent (via (3.16 to the condition 
(3.23) |c| < 1. 

Remark 3.3. Note the total disappearance of the primary mode coordinates r,9. From a symplectic geometry 
viewpoint, we have effectively taken the symplectic quotient of the state space with respect to the rotation 
symmetry x i— > e'^cc. The elimination of these (very large) coordinates is conducive to analyzing the evolution of 
the (much smaller) secondary and peripheral modes accurately. 



When one is near the torus Tj (which in these coordinates is just the origin c = 0), we expect the cubic terms 
0{c^) in the above proposition to be negligible. From the equations (3.20), (3.21), (3.22) we conclude in this 
regime that the two real- valued modes c~j_]^ are linearly stable, the two real- valued modes c^j^^ are linearly unstable 

(growing like e^*), and the remaining modes c* are oscillatory (behaving like e**). Observe also that most of the 
nonlinear interaction on these modes resembles a diagonal linear potential of magnitude 0{(?), indicating that 
the coupling between these modes is relatively weak (especially when 0{c^) is small); however there are some 
troublesome interactions arising from the final terms in the right-hand sides of (3.20), (3.21 ) that allow the stable 
modes to influence the unstable modes and vice versa (via coupling with other modes). This causes a certain 
amount of mixing in the evolution which will require some care to handle, and is directly responsible for the 
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rather inelegant appearance of polynomial factors of T (in addition to the more natural exponential factors) in 
the analysis of later sections. 

For the purpose of obtaining upper bounds for the magnitude of the various modes, the following useful 
lemma captures the stable nature of the c~j^i, the unstable nature of the c^j-i, and the oscillatory nature of the 
Ct, provided that the solution stays close to the circle Tj (which corresponds in these coordinates to c = 0) in a 
certain sense. 

Lemma 3.2 (Upper bounds). Suppose that [Q,t] is a time interval on which we have the smallness condition 

(3.24) f \c{s)\^ ds<l. 

Jq 

Then we have the estimates 

(3.25) |c-±i(i)| < e-^*|c-±i(0)| + f e-^'^'-'^^\c+^,{s)\\c%^,{s)\ ds 

Jo 

(3.26) |c+±i(i)| <e^*|c+±,(0)|+ f e^'^'-'%^Amc^,±ii^)\' ds 



(3.27) 
(3.28) 



|c,±i(i)|<e^*|c,±i(0)| 
|c*(i)|<|c*(0)|. 



Proof. We take absolute values in (3.20), (3.211, (3.22) and obtain the differential inequalitiej^ 



< lr-|2|p^*r 



dt\c^A<\c\'\c.A 



±l\\C^3±l\ 



i±i| 



The claims (3.25), (3.26), (3.28) now follow from Gronwall's inequality. To obtain (3.271, we take absolute values 



of (3.20), (3.21 1 and sum to obtain 



dt(\c- 



□ 



.,-±l, + |c+±ll)<(^/3 + 0(|c|2))(|c-±^ 
and the claim now follows from Gronwall's inequality again. 

Remark 3.4. The slider solution (3.7), using the j — 1 coordinates, is simply (i) = 0,cj(t) — (1 -|- e~2\/3t^-i/2 
thus escaping away from Ti using the unstable component of the 2-mode. Viewed instead in the j = 2 coordinates 
it becomes c^(i) = (1 + e2\/3t-j-i/2^ c^{t) — 0, thus collapsing into T2 using the stable component of the 1-mode 
We shall use this solution to transition from {Mj^, d^, Rj') to (AfJ^j^, dj+i, RJ+i)- 



3.4. Construction of the Targets. 

Always be nice to people on the way up, because you'll meet the same 
(Wilson Mizner) 



e on the way down. 



We now are ready to construct the targets {M~ , dj , RJ), {Mj, d^, i?°), {M'^ ,d1j , R^). As it turns out, these 
sets will lie close to Tj (and thus away from the coordinate singularity at hj = 0), and will therefore be represented 

(The modes r, are also present but will have no impact on 



using the modes c^^^, c^_-^, c^, from Proposition 



3.1 



14 



As the quantity being differentiated is only Lipschitz ratlier than smooth, these inequalities should be interpreted in the 
appropriate weak sense. 
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our computations.) Broadly speaking, these targets will demand a lot of control on the leading modes (in order 
to set up future "ricochets" off of subsequent tori "Tj+i, '^j+2; etc.) but will be rather relaxed about the trailing 
modes (as they will become small and stay small for the remainder of the evolution). 

We need a number of parameters. First, we need for technical reasons an increasing set of exponents 

for sake of concreteness, we will take these to be consecutive powers of 10 (thus — 10, A" = 10^, and so forth 
up to A%_^ = 103^-13)^ 

Next, we shall need a small parameter 

< cr < 1 

depending on N and the exponents A (actually one could take a = 1/ 100 quite safely) . This basically measures the 



distance to at which the local coordinates become effective, and the linear terms in Proposition 3.1 dominate 
the cubic terms. 

For technical reasons, we shall need a set of scale parameters 

1 « r%^2 « ^^-2 « 4-3 « ■ • • « 4 « rl 

where each parameter is assumed to be sufficiently large depending on the preceding parameters and on a and the 
A's; these parameters represent a certain shrinking of each target from the previous one (in order to guarantee 
that each target can be covered by the previous). 

Finally, we need a very large time parameter 

T> 1 

that we shall assume to be as large as necessary depending on all the previous parameters (in particular, we will 
obtain exponential gains in T that will handle all losses arising from the A, a, r parameters.) 

For each 3 < j < — 2, we also need the concatenated coordinates 

c<j_2 := (ci, . . . ,Cj_2) G C-*^^ (trailing peripheral modes) 

c<j_i := (ci, . . . ,Cj„i) G C-'^"'^ (trailing modes) 

c>j+i :— (cj+i, . . . , Cm) G C^^^ (leading modes) 

c>j+2 '■— {cj+2, ■ ■ ■ J Cm) G C^^-*^^ (leading peripheral modes) . 



Very roughly, the targets (A/^ ,dj ,Rj ), (M°,d°,i?"), (M^ ,dj' , Rj) can be defined in terms of the modes 
c by Table [l] Thus for instance, the c>j+2 mode of {M^ , dj , Rj ) ranges over arbitrary values of magni- 
tude 0{r~ e~^^^), plus an unavoidable uncertainty of magnitude 0{T^j e"^^^). As one advances from 
{Mj' ,dj ,RJ), to {M°,d°j,R°), to {M+,d+,R+), to {MJ'^^,dJ_^_^,RJ^^), and so forth, the uncertainty will in- 
crease by a polynomial factor in T (this will be where the A exponents come in) , while the size of the manifolds 
M will shrink somewhat (this will be where the r parameters come in) . 

It will take time T to flow from (M-JJ^RJ) to (M°, d°,R°), and time T from {M°, d°, i?°) to {M+,d+,R+). 
(On the other hand, we will be able to flow from {M^,d'^,R'^) to {M~+i,dJ_^_-^, RJ_^_-^) in time O(logi).) The 
reader may wish to verify that this is broadly consistent with Table [Tl using the heuristics from the previous 
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section that the stable modes Cj_j_-^ should decay by a factor of e ^ over this time, the unstable modes 
should grow by the same factor of e^'^ , and the remaining modes c<^_2, c>j_2 should simply oscillate. 

We now give more precise definitions of these objects. 

Table 1. The targets {M-^dJ,Rj), (MO,d°,i?0), {M+,d+,R+). It is the factors of e"^^ 
which are the most important feature here; the polynomial powers of T in the uncertainties, 
and the scales r in the main terms, are technical corrections which should be ignored at a first 
reading, and u should be thought of as a small constant (independent of T or the r). Note that the 
outgoing target {M^ ,d'j' , Rj') resembles a shifted version of the incoming target {M^ ,dj , Rj); 
this will be important in Section |3.10| 



Mode 


{M-,d;,R;) 


(MO,dO,i?0) 


(M+,4,i?+) 


C<j-2 


o + o(r^7e-2v^^) 




o + o(r^;e-2v^^) 










c+ 




+ O(T^°e-3^^) 




^7+1 




+ O(r^?e-3V3T) 




S+i 


0(r-e-2v^^) 






C>j+2 








6 


uncontrolled 


uncontrolled 


uncontrolled 



The Incoming target (M^ , ): 



We define the incoming target {Mj , d^ , i?^ ) by setting AI^ to be the set of all points whose coordinates obey 
the relations 

C<j-2,C+_i = 

Vi = ^ 

with uncertainty RJ := T"^j and with semimetric dj {x, x) defined by 

d-{x,x) e^^^|c<,_2 - £<.-2| + e^^lcT., - cT_ J + e4^^^|c+_, - c+.J 

where c of course denotes the coordinates of x. This metric is not defined on the set bj = (where the local 
coordinates break down), but this is not of importance to us because the metric is well defined for all points 
within (M~ ,dj , Rj), since T is so large. (If one wished, extend dj in some arbitrary fashion to be defined on 
the remaining portions of E, but this will have no impact on the argument.) 

Informally, the incoming target has a significant presence (of size roughly a) on the trailing stable mode, but 
is small elsewhere, except of course at the primary mode. 
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1 

Too 



C<.7-2 



c>, 



Figure 4. An illustration of a configuration of modes within the incoming target {M~ , dj , RJ)- 
The ilhistration ignores the tiny polynomial corrections. The diagonal lines highlight the vertical 
scale compression between the tiny exponential scales and the scale of cr ~ j^. 



The Ricochet target {M^ ,(f^,R^): 

We define the ricochet target {Mj,d^,Rj) by setting Mj* to be the set of all points whose coordinates obey 
the relations 



c<j-i,c^+i = 

|c>,+2|<r0e-'^^ 



with uncertainty i?^ := T"^^ and with semimetric dS{x,x) defined by 



Informally, the ricochet target is small everywhere outside of the primary mode, but has its largest presence 
at the trailing stable mode and the leading unstable mode, with the latter just having overtaken the former. 



The Outgoing target {M+,d+,R+): 
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1 

Too 



>1 



-2^/^iT 



C<]-2 S-1 



Figure 5. An illustration of a configuration of modes within the ricochet target (A/j*, d^, 
Note that all the modes displayed here are extremely small relative to cr ~ . 



We define the outgoing target {M^ , dj , by setting MJ*" to be the set of all points whose coordinates obey 
the relations 

4+1 = ^ 

with uncertainty = T^^ and with semimetric {x, x) defined by 



Informally, the outgoing target has a significant presence (of size a) on the leading unstable mode and is small 
at all other secondary and peripheral modes. 



3.5. Proof of Theorem 3.1 



assuming ( |3.11[ ). From (3.111, we see that there is at least one solution h{t) to 
(3.11 which starts within the ricochet target (Afg , d^,R^) at some time t^ and ends up within the ricochet target 
(M]y_2i d'jf_2, R%-2) some later time ti > to. But from the definition of these targets, we thus see that b{to) 
lies within a distance 0{r^e~^'^) of T3, while 6(ti) lies within a distance 0{rpf_2e~^'^) of Tjv_2. The claim 



follows. 



It now remains to show the covering relationships (3.8), (3.9l, (3.101. That is the purpose of the next three 
sections; the main tool shall be repeated and careful applications of Gronwall-type inequalities. 
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1 

Too 



>1 



-2^/^iT 



C<]-2 S-1 



Figure 6. An illustration of a configuration of modes within the outgoing target {M^ ,d1j ,Rj). 

3.6. Flowing from the Incoming Target to the Ricochet Target. Fix 3 < j < iV — 2. In this section 
we show that {M~ ,dj , Rj) covers {Mj,dpR^); this is the lengthiest and most delicate part of the argument. 
We shall be able to flow from {M~ ,d~ , Rj) to {Mj,d'j,R'j) for time exactly T. To do this we will of course 
need good control on the evolution of a solution starting within {M~ ,dj , RJ) for this length of time. The full 
evolution is summarized in Table [2] but to establish this behavior we shall need to proceed in stages. Firstly, we 
need upper bounds on the flow; then we bootstrap these upper bounds to more precise asymptotics; then we use 
these asymptotics to hit arbitrary locations on the flnal target (Afj*, starting from a well-chosen location 

on the initial target {M~ ,d~ , Rj). This basic strategy will also be employed in the next two sections, though 
the technical details are slightly different in each case. 

We begin with some basic upper bounds on the flow. 



Proposition 3.2 (Upper bounds, inbound leg). Let h{t) he a solution to (3.1 ) such that 6(0) is within (Af , d , i? ) 



Let c{t) denote the coordinates of b{t) as in Proposition 3.1 Then we have the hounds 



\cAt)\ 


==o(r^re-2v^^) 


S- 




= 0(ae-^*) 


c+ 




= 0(r2A- + lg-4\/3Tg\/3t^ 






= 0(r-(l + t)e-2^^e-^*) 


c+ 




= 0(r7e-2^^e^*) 



forO<t<T. 



Proof. From the hypothesis that 5(0) is within (M^ , dj ,Rj ), we easily verify that the bounds hold at time t — 0. 
To establish the bounds for later times, we can use the continuity method. Let Cq be a large constant (depending 
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Table 2. The evolution of a solution which is within the incoming target {Mj , dj , Rj ) at time 
t — 0. The a>2,aj' ,a~ are explicit numbers defined below in (3.341, (3.421. The function g{t) 
and the transfer matrices Gj^i{t), G>j+2(^) £^re explicit objects (depending only on a) that will 
be defined below. The configuration of the modes in the incoming target at t = is illustrated 
in Figure |4] Upon arrival at i = T, the modes have adjusted and appear as in Figure |5] 



Mode 


t = 


< ^ < T 


t = T 


C<j-2 


0(r^7e-2y3T) 




0(T^7e-2v^^) 






5(t) + 0(e-^^) 


0(e-^^) 


c+ 


o(r^7e-4v^^) 


0(y2A7 + lg_4V3TgV3t) 


0(2.2A- + lg-3V3T) 






0(r^7e-2N/3Tg-V3t) 




c+ 


-2V3T + 


+ 0(r^7+2e-3V3Tgy3t) 




C>j+2 


e " a>j+2 

+0(r^^- e-3V3T) 




e-2^^G>,+2(T)a>,+2 
+0(7-4.^6-3^^) 



only on iV) to be chosen later, and suppose that < T' < T is a time for which the bounds 





c*(t) 


= 0(GoT-47e-2^^) 






= 0(Goae-^*) 


4 




= 0(GoT2^7+ie-4^^e^*) 




+iWI 


= 0(Gor-(l + i)e-2^^e--^*) 


4 


+iWI 


- 0(Gor-e-2^^e^*) 



are known to hold for all < i < T'. We will then prove the same bounds without the Cq factor for the same 
range of times t, implying that the set of times T' for which the above statements hold is both open and closed; 
since that set contains 0, it must then also contain T and we are done. 



Our main tool shall of course be Lemma 3.2 From our bootstrap hypotheses (and the largeness of T) we can 
control the total magnitude of the modes: 

(3.29) \c{t)\ < GofTe-^* + CorJ e'^^^ e^'' 

for all < t < r' (the trailing stable mode is almost always dominant, except near time T where the leading 
tinstable mode begins to compete) and hence (3.241 will hold for all < < < T' (assuming a is small depending 
on Go). From (3.281 we thus have 

(3.30) |c*(t)| < |c,(0)| <r^7e- 



-2V3T 



for all < < < T' , which is the desired bound on the peripheral modes c*(i). Similarly from (3.27) we have 

(3.31) \c,^m < e^*|c,+i(0)| < r-e-^^^e^* 

which gives the desired bound on the leading unstable mode Cj'_^^. From ( |3.30 1, (3.31 1 we have 



(3.32) 



\c^j-iit)\<e 



< -2^/3T 



(T-*. +r-e 
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and hence from (3.251 we have 



< e-^*|c-_,(0)| + /*e-^(*-^)|c+_,(.)|e- 
Jo 



r-e^")2 ds. 



Since c~{0) = 0{a) and c+{s) = 0{CoT^^' +ie-'*^^e^*), we conclude (using the largeness of T) that 



(3.33) 



|c,_i(i)-| = 0(ae-^*) 



which is the desired bound on the trailing stable mode. This and (3.301 imply that 



and then from (3.25) we have 



i(0)| 



Using (3.311 to estimate |c^+2(^)li together with the initial bound |c _|_^(0)| = 0(r - e ^v^t-j obtain 



|c7+i(t)| <r-(l + t)e 



which is the desired bound on the leading stable mode. Finally, from (3.261 and (3.32) we have 



and thus from (|3.33[) and the initial bound \c+_^{0)\ = 0{T^^ e^^^^) we have 



which is the desired bound on the trailing unstable mode. 



□ 



For the rest of this section, the time variable t is assumed to lie in the range < t < T. Let us make some 
more precise hypotheses on the initial data, namely 



4+1(0) = e-'^'^a+^, + 0(T-^7e-3^^) 



A. -3V3T^ 



(3.34) 



c>,+2(0) = e-^^^a>,+2 + 0{T''^ 



A- „-3\/3T 



for some data af^i G a>j+2 G C^^-'^^ of magnitude at most rj /2 which we shall choose later. In order to 
reach the turnaround set (Mj*, d^R^), we will need slightly more precise bounds on the leading modes (and also 
the stable trailing mode, which is dominant) as follows. 

Improved control on c~_i{t). We first give a better bound on the trailing stable mode c~_j^(t), which is the 
largest of all the modes (other than the primary one, of course). Observe from Proposition |3.2| that any cubic 
term 0{c^) splits as the sum of a main term of the form 0{{cj_^)^)), plus an error of size at most 0{T^i e~^^"^) 



(say). Thus we may rewrite (3.20) somewhat crudely as 
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for some explicit cubic expression 0{{Cj_i) ) of Cj_-^. Now let g be the solutiorj [to the corresponding exact 
(scalar) equation 

dtg = -VSg + 0{g') 

with the same initial data g{0) = a. Because a is small, it is easy to establish the decay bound 

(3.35) 9{t) = 0(ae-^*) 

(e.g. by the continuity method). Writing c~_j = g + EJ_i, the error function E~_T^{t) thus obeys the difference 
equation 

dtEJ_^ = -V3E-_, + 0{<j^e-^^*\E-_,\) + 0{\ET^,f) + 0{T^^ e-^^^) 

with initial data EJ'_-^(0) — 0. From this equation it is an easy matter (e.g. by the continuity method) to establish 
the bound 

E-^,{t) = o(T-47+ie-2v^^) 
for all < t < T. In other words we have the estimate 

(3.36) c-_,{t) - g{t) + 0(r^7+ie-2^^). 
This and Proposition 



3.2 



allow us to refine our bound for c and for 



(3.37) 

and 

(3.38) 



A, +1-2V3T-V3t 



Improved control on c>j+2- Now we control the leading peripheral modes. Inserting (3.371 into (3.17) we see 
that 

We approximate this by the corresponding linear equation 

dtu = iu + 0{ug'^) 

where u{t) E C^^^^^. This equation has a fundamental solution G>2{t) ■ C^^-'^^ -j-q^. .^yi t > 0, thus 

G>2{t)u{0) — u{t). (Again, this solution could be described explicitly since g is itself explicit, but we will not 



need to do so here). From (3.351 we have 
(3.39) 



g\t) dt = 0{l), 



and so an easy application of Gronwall's inequality shows that 

(3.40) |G>2(t)|,|G>2(t)-i| = 0(l). 

Since c>j+2(0) = e~'^^^ a>j+2 + 0{T^i e^'^^'^), we are motivated to use the ansatz 

c>j+2 — e G>2{t)a>j+2 + £'>j"+2- 
The error i?>j+2 then solves the equation 



dtE>,+2 - iE>,+2 + 0{E>,+2g^) + 0{T 



A, +1 -2\/3T 



C>i+2|) 



15n 



3.4 



up to a 



This function g can in fact be computed explicitly, in fact it is essentially the function appearing in Remark 
translation in time. However, we will not need to know the exact formula for it here; the only relevant features for us is that g 
depends only on a (and possibly N) and obeys the bound ( |3.35[ l. 
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with initial data E>j+2iO) = 0{T^^ e~^^'^). Applying the bound on c>,+2 from Proposition 



3.2 



we see that 



From Gronwall's inequality and (3.391 we conclude that 
for all < < < T, and thus 

(3.41) c>,+2(t) = e-2-^^G>2(t)a>,+2 + 0{T^7 e-^^^^). 



Improved control on Cj+i- Now we consider the two leading secondary modes c!f^i, c^_^_i simultaneously. From 



(3.201, (3.21), (3.371, (3.38), and Proposition 3.2 we have the system 



If we make the ansatz 

^7+1 = e^'^^e-^^+^it); C++, = e'^^^^ e^''~4^,{t) 
to eliminate the constant coefficient terms, then the system becomes 

with initial conditions af^i{Q) = afj^-^ + 0{T^i e"^^). Writing a^+i := ( "4+^ ) and aj+i(t) 



we can write this as 

(3.42) dta,+i{t) = A{t)d,+i{t) + 0{T^7+^e-^^^e^*); aj+i(0) = a,+i + 0{T^^ e'^^) 



«;+i(<) 



where A{t) is an explicit real 2x2 matrix (depending only on a and t) which (by (3.35)) has bounds of the form 



A{t) = a 



2 I 0(e-2V3t) o{l) 

0(e-4V3t) 0(e-2V3t) 



Unfortunately, the non-decaying coefficient 0(1) here prevents a direct application of Gronwall's inequality from 
being effective. However, because this coefficient is located in a "nilpotent" part of the matrix, we can proceed 
using the following variant of Gronwall's lemma. 

Lemma 3.3 (Gronwall-type inequality). Let x(t),y{t) be vector-valued functions obeying the differential inequal- 
ities 

\dMt)\<Se-'^'\x{t)\ + S\yit)\ + \F{t)\ 
\dty{t)\<Se-f'*\x{t)\ + Se-'''\y{t)\ + \G{t)\ 
for some a, /?, 7 > 0, some < 6 < I, all < t < T , and some forcing terms F(t), G{t). Then we have 

w)\ < (1 + t)\m\ + t\ym + f\i + ter^nms)] + t\G{s)\ ds 



mi < \xm + mi + / e-''^\F{s)\ + \g{s)\ ds 

Jo 
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for alio <t <T, where the implicit constants are allowed to depend on a^j3,^. If F — G = and 6 is sufficiently 
small depending on a,/?, 7, we also have the lower bound 

\y{t)\>l\ym-Oi\xm). 

Proof. Throughout this proof wc assume that t hcs in the range < t < T, and imphed constants can depend on 
a, /3, 7. From the equation for dtx{t) and the usual Gronwall inequahty we have 

kWI< W0)|+ f\y{s)\ + \F{s)\ds. 
Jo 

Writing Y{t) := supg<s<t |?;(s)|, we conclude that 

(3.43) \x{t)\<\x{0)\+ f \F{s)\ ds + tY{t). 

On the other hand, from the equation for dty{t) and Gronwall's inequality we have 

\ym<\ym+ f e-^^\x{s)\ + \G{s)\ds- 



inserting (3.431 we conclude 

W)\<\ym + / e-''^(|a;(0)|+ / F{s') ds') + \G{s)\ds+ I e-'''Y{s)s ds. 



By Fubini's theorem we have e ^*(|a;(0)| + F{s') ds') ds < |a;(0)| + e ^'^F{s) ds. Taking suprema in t 
we conclude that 

Yit)<\y{0)\ + \xiO)\+ / e-f''\F{s)\ + \Gis)\ds+ / e-^'Y{s) ds 
Jo Jo 



and hence by Gronwall's inequality again 



\y{t)\ < Y{t) < |y(0)| + |.t(0)| + / e-^^F{s)\ + \G{s)\ ds 



The upper bounds on x and y now follows from (3.431. 

Now suppose that F = G = and 6 is small. The triangle inequality gives us, 

\yit)\>\yiO)- f\dsyis)\ds. 



With the given bound for |(9sy(s)| and the upper bounds we just proved for y{s),x{s), we conclude, 

|y(t)| > |y(0)|- f C5e-P'\x{s)ds- f C5e~-^'\y{s)ds 
Jo Jo 

> |y(0)|- / C6{l + t){x{0) + y{0))ds, 
Jo 

and the final claim in the Lemma follows by taking 6 small enough. □ 

Let Gj+i{t) be the transfer matrix associated to A{t), i.e. Gj^i{t) is the real 2x2 matrix solving the ODE 

dtG,+i{t) - A{t)G,+,{t); G,+i(0) = id. 
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Then from the Lemma 3.3 the coefficients of Gj+i enjoy the bounds 



(3.44) G,+i(i) - I ^^^^ ^^^^ 



Since cr is small, we can also use the last part of Lemma [373| and conclude that the coefficient in the bottom right 
corner has magnitude at least 1/2. (This will be important later when we "invert" Gj+i{T).) 



Now we return to (3.421, and use the ansatz 

aj+i{t) = G.j+i{t)aj+i + Ej+i{t) 
to obtain an equation for the error Ej^i. 

and then by Lemma |3.3| again we obtain the bounds 

Ej+i{t) = 0(T-47+2e-2V3^e^*) + 0(T^7+ie"^^) = 0{T^^ +^e-^^^). 
We thus conclude that 

(3-45) ( ) = G,,.{t)a,,, + 0(T-7-^e-^-). 



3.7. Hitting the Ricochet Target. Our estimates are now sufficiently accurate to show that (M^ ,dj ,Rj ) 
can cover {M^, d^, ^'j)- Consider an arbitrary point x'j in AIj, which in coordinates would take the form 

c<j-i = cj^i = 

for some z^_^^ G M, z>j+2 G C^^-'^^ of magnitude at most r°. We need to locate a point xj in M~ , which in 
coordinates takes the form 



-4- 


1 


= 


s- 


1 


= a 




-1 


= aj+,e~ 








c+ 


-1 


- 4+1 e- 


C>j4 


-2 





2V3T 
2V3T 
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for some a^_^-^ G M, a>j+2 G ^ ^ of magnitude at most r /4 (say) to be chosen later, such that given any 



data c(0) which is within RJ = T^i of xj , thus in coordinates 



the evolution of this data after time T will lie within i?^ = T'^j of a;*^ in the d*? metric. Thus we aim to show, 



|c<,_2(r)| + e^^|cT_,(T)| + e^^^-lct ^(T)| + 



(3.46) 



To establish this, we of course apply the bounds obtained in this section. From Proposition |3.2| we have 



|c<,_2(T)HO(T^7e- 



-2V3T\. 



); c7.i(T) = 0(e-^^); cUiT) ^ OiT^^^ 



'2^i +l^-3\/3T 



and hence the contribution of the trailing modes c<j-i to (3.461 will be acceptable (recall that is ten times 

-3y3T^ 



larger than ). From (3.41 1 we have 

c>j+2{t) = e-2^^G>2(T)a>j+2 + 0{T' 



Thus if we set a>j+2 := G>2(T) ^z>j+2, then the contribution of the leading peripheral modes c>j+2 to (3.461 
is acceptable. Note that since |z>j+2| < r'^, then |a>j+2| < rj thanks to (3.401 and the construction of large 
compared to r?. 

Finally, we need to deal with the leading secondary modes. According to Proposition |3.2| the contribution of 
cj^i(^) to the left hand side of (3.461 is acceptable. As for c^^^, from (3.451 we have 



g3y3T^ 
„y3T„+ 



G,+i(T)a,+i + 0(T 



A. +2-V3T 



Now recall that the matrix Gj+i(T) has the form (3.441, with the bottom right coefficient having magnitude 
comparable to 1. Because of this, and the hypothesis that l^^^il < r^, one can easily find coefficients a~^_;^, 
of magnitude at most rj (which is large compared to r^) such that 



where the exact value of the coefficient ... is not important to us since we already bounded Cj^]^(T) above. We 
thus have 



4+i(T) 



_ „-y3T. 



l + 0(T-^7+2e-2V3T) 



and conclude that the contribution of this term to (3.461 is also acceptable. This concludes the proof that 
(M- , , rj ) covers (M° , d° , r^^ ) . 
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3.8. Flowing from the Ricochet Target to the Outgoing Target. Again wc fix 3 < j < — 2. Wc now 

show that {Mj,d'j,r'j) covers (MJ*", d^, r^). Broadly speaking, this will resemble a time-reversed version of the 
arguments in Section |3.6| though there are a number of technical differences. On the one hand, the trailing 
secondary modes are now much less important and do not require as delicate a treatment as in Section |3.6| On 
the other hand, the time reversal changes the role of a; instead of starting with a stable trailing mode at size a 
and ensuring that it decays in a controlled manner, we now are starting with an unstable leading mode Cj'^i of 

small size (like e~^^) and growing it so that it reaches a almost exactly at time T. 



Table 3. The evolution of a solution which is within the ricochet target {Mj,d'j,R'j) at time 
< = 0, and with a well-chosen (and small) leading unstable mode c^_^j^. The a>2, o-t+i are explicit 



numbers defined below in (3.491, (3.551. The function g{t) and the transfer matrix G'>j_|_2(i) are 
explicit objects (depending only on a) that will be defined below. The t — configuration of the 
modes within the ricochet target is illustrated in Figure |5] At t = T, the modes are within the 
outgoing target as illustrated in Figure [6j 



Mode 


t = 


<t<T 


t = T 


C<j-2 


0(T^"e-2V3T) 


o(r^"e-2^^) 


o(r^"e-2^^) 


Vi 


0(T^"e-^^) 


0(r^"+ie-^^e-^*) 




r+ 


0(T^"e-3^^) 


0(r^"+ie-3^^e^*) 


o(r^"+ie-2^^) 


s+1 


0(T^"e-3V3^) 




0(y2Ay+3g-4V3T) 








(T 

+ 0(r^?+2g-y3T) 


C>j+2 




e-2^^G>,+2(i)a>,+2 


e-2v^^G>,+2(T)a>,+2 

+o(r^"e-3^) 



Once again, we begin with upper bounds on the flow, though now we also need a smallness condition on the 
unstable mode c^_|_j^(0) (to stop the evolution from moving too far away from Tj by time T). 



Proposition 3.3 (Upper bounds, outbound leg). Let b{t) be a solution to (3.1 1 such that 6(0) is within {Mj, d", i?°). 

Assume the smallness condition cj, ^(0) = 0{ae^^^). 



Let c(t) denote the coordinates ofb(t) as in Proposition 
Then we have the bounds 



3.1 



|c*(i) 
|c-_i(t) 



|ct,(t) 



0{T' 



3 e 



Q(^y2A^"+3g-3\/3Tg-V3t-j 



= 0{ae 



forO<t<T. 



Proof. From the hypotheses we easily verify that the bounds hold at time t = 0. As in the proof of Proposition 
3.2 we use the continuity method. We again let Cq be a large constant (depending only on A'') to be chosen later, 
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and suppose that < T' < T is a time for which the bounds 



|c4t) 

|c-_i(t) 

ic;_i(t) 

Ic7+i(*) 



0{Coae 



are known to hold for all < t < T'. As before, it suffices to then deduce the same bounds without the Cq factor. 

From the above bounds we have 

\cit)\ = 0(CoT^"+ie-^^e-^* + Coae-^^e^*) 



which gives the bound (3.241 (taking a small compared to Cq. Lemma 3.2 now applies. From (3.281 we have 



which gives the desired control on the peripheral modes c^. From (3.27) we have 

|c,+iWI<e^*|c,+i(0)|<ae-^^e^' 



which gives the desired control on the unstable leading mode cj_^-^. Next, we apply (3.251 to obtain 

|c7_i(t)| < e-^*c7_i(0) + re^(*-^)|c+_i(s)||c(s)|2 ds 

Jo 

which after substituting the initial bound cJ_-^(0) — 0{T^o e~^'^) and the above bounds on c^_i{s), c(s) gives 



which is the desired bound in the stable trailing mode c^^^. Then, we apply (3.261 to obtain 

< e-^*c+_,(0) + re^(*-^'|c-_i(.)||c(s)p ds 
Jo 

which after substituting the initial bound c^_j(0) = 0{T'^i e^^^'^) and the above bounds for cj_i and c becomes 
which is the desired bound on the unstable trailing mode c^Li- These bounds imply in particular that 



while from (3.25) we have 



|c-+i(i)|<e^^*c-+i(0)+ / e-^(*-^)|c+ (.)||a,+i(.)|d. 



Combining these bounds and also using the initial bound c^^j^(O) = 0{T^ie ^x/st-j ^^j^^ ^t^^ bound already 
obtained for c^_|_i, we conclude 



which is the desired bound on the leading stable mode c^+j^. 



□ 
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Now we need more precise bounds on the leading modes. Here we will assume the initial data takes the form 

for some data a^^j^ e M, a>j+2 & C^"-'"^ of magnitude at most 0(1) and rj /2 respectively, which we shall 
choose later. Henceforth the time variable is restricted to the interval < t <T. 

Improved control of c^+i- .We begin by refining the control on the unstable leading mode. From Proposition 
we see that any expression of the form 0{c'^) splits as the sum of a term 0{{Cj'_^i)^), plus an error of size 



3.3 



Let us now compare this against the function g, defined as the solution of the associated equation 

dt~g = V3~g + 0{f) 

with initial data 

(3.47) ~g{T) = a, 

at time T. If a is small, then an easy continuity argument]^ backwards in time shows that 
In particular we have 

(3.48) r \m? dt = 0{l). 

Jo 

We now fix aj^^i by requiring 

(3.49) g(0)=ae-^^a++i, 
thus a^^-^ = 0(1) as required. If we then use the ansatz 

then the error Ej'_^-^ obeys the equation 

d,E+^, = V3E+^, + Om'\E,+^\) + Oi\E,^,\') + 0(T34+3e-3y3T^V3*) 

with initial dat a i?^ -^(0) — 0{T^^ e^^'^^). A simple application of the continuity method and Gronwall's 
inequality (and (3.48)) then yields the bound 

E+^,it) = 0(T^?e-2v^^e-^*) 

for all < < < T. We thus conclude that 

(3.50) c++i(i) = m + 0(T3^?+3g-2V3Tgy3t) 
which then (in conjunction with the bounds in Proposition 3.3 implies that 

(3.51) 0{c^) = 0{f) + 0{T^^°+^e-^^^). 

^^Alternatively, one could observe that g is basically the time reflection of the function g used in the preceding section. 
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Improved control of c>j+2- Wc now control the leading peripheral modes, by essentially the same argument 
used to control these modes in the previous section. From ( |3.17[ ) and (3.51 ) we have 

We approximate this by the corresponding linear equation 

dtu = iu + 0{ug^) 



where u{t) e C*^ ^ ^. This equation has a fundamental solution G>j+2{t) ■ C^^ ^ ^ 
thus G>j+2(i)w(0) = u{t). From (3.481 and Gronwall's inequality we have 



iM-j-i f-Qj, t > 0, 



(3.52) 

We use the ansatz 



|G>,+2(<)|,|G>,+2(t)-i| = 0(l). 



c>j+2{t) = e 



" ^^^G>j-).2(i)a>j+2 + £'>i+2- 



The error i?>,+2 then solves the equation 



dtE>,+2 = iE>,+2 + 0{E>j+2f) + 0{T 



2A''+2-2V3T 



\c>j+2\) 



with initial data i^>j+2(0) = OiT^'U'^^'^). Applying the bound on c>j+2 from Proposition 

a,|S>,+2| = 0(|^>,+2||.9n + 0(T3^^^e-4^^). 



3.3 



we see that 



From Gronwall's inequality and (3.481 we conclude that 



\E>,+2{t)\ = 0{T'^°+'e-'^'^) 



for all < < < T, and thus 



(3.53) 



C>j+2{t) 



_ -2V3T 



G>,+2{t)a>,+2 + OiT""^ 



)• 



3.9. Hitting the Outgoing Target. Our estimates are now sufficiently accurate to show that {Mj, d", i?°) can 
cover {M^ , rf^, Rj^)- Consider an arbitrary point in Mj' , which in coordinates would take the form 



c<j-i = c^+i = 
— a 

2^T 



4+1 = 



C>j+2 = Z>j+2 

for some 2:>j+2 G C^^^^^ of magnitude at most r^. We now specify a point Xj in Mj, which in coordinates has 
the form 



-2V3T 



C>j+2 — 0->j+2 



e 
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with cij^i = 0(1) defined in (3.491 and some a>j+2 G ^ ^ of magnitude at most r'j /2 to be chosen later (see 
(3.551 below), such that given any data c(0) which is within i?^ = T^j of x'j in the metric , thus in coordinates 

(0) = o(r^"e-2^^) 



C<j-2( 



c+_,(0) = o(r^?e-3^^) 

cj^^(0) = 0(r^?e-3^^) 
c+,^(0) = ae-^^a++i + ©(T^^g-^^^) 
c>,+2(0) = a>,+2e-^^^ + 0(r^"e-3^^) 
the evolution of this data at time T will lie within = T^i of x1j in the metric. Thus we aim to show, 



|c<,_i(T)| + e4^^|cT^i(r)| + ey^^^lc+^,(r) - a| 



\/3T| 



(3.54) 



+ e3^^|c>,+2(T)-e-2^^z>,+2|<T^ 
We argue as in the previous section. From Proposition |3 . 3| we have 



c-_i(r) = 0(T-4?+ie-2v^^) 
c+_i(r) = 0(T-4°+ie-2v^^) 



c7+i(r) = o{T 



which shows that the contribution of the trailing modes and the leading stable mode will be acceptable. From 



(3.501, (3.47) we have 



^i(r) = a + 0(T 



SO the contribution of the leading unstable mode will also be acceptable. Finally, from (3.53) we have 



so if we choose 
(3.55) 



a>j+2 = G>j+2{T) ^z>j+2 



J e 



-3V3T 



then from (3.52) we see that a>j+2 will have magnitude 0{r^) < r'j/2, and the contribution of the leading 
peripheral modes will also be acceptable. This completes the proof that (Mj',d°,r°) covers (M^ ,r^). 



3.10. Flowing from the Outgoing Target to the Next Incoming Target. Fix 3 < j < iV — 2. To conclude 
the proof of Theorem 3.1 we need to show that (AIj^ , cT^ ,r^) covers {M~^-^, d~_^_-^,r~_^-^^). This turns out to be 
significantly simpler than the previous analysis because we will only need to flow^'' for a time 0(log ^) rather 
than time 0{T). This means that we can rely on much cruder Gronwall inequality-type tools than in preceding 
sections as we do not have to be so careful about exponential or even polynomial losses in T. On the other hand, 
the analysis here is more "nonperturbative" in that we are no longer close to a circle Tj or Tj+i but instead 



^^Indeed, if we could take a as large as l/\/2 then we would not need to flow at all, and we could essentially match up the j'*'' 
outgoing target with the j + l"* incoming target. However we took advantage of the smallness of a at too many places in the above 
argument, and so we are forced to add this bridging step as well. 
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will be traversing the intermediate region connecting the two. Fortunately, we have an expliciilj solution (based 



on (3.7)) that we can follow (via standard perturbative theory) to carry us from one to the other without much 



difficulty. 



We turn to the details. Before we begin, there is an issue of which coordinate system to use: the local 
coordinates around Tj, the local coordinates around T^+i, or the global coordinates 61, . . . ,&Af. It is arguably 
more natural to use global coordinates for the transition from to T^+i, but we will continue using the local 
coordinates around Tj since we have already built a fair amount of machinery in these coordinates. Also, these 
local coordinates have already quotiented out the phase invariance x 1-^ e^^x which would otherwise have required 
a small amount of attention. 



Let us start with initial data within the outgoing target {M^ 
such an initial data takes the form 



In the local coordinates around 



c<,_i(0) - 0{T^^ e 



i(0) = 0{T' 



-4V3T 



i(0) 



c>j+2(0) = e-^'^^a>j+2 + OiT^'o e 



"3%/3T 



for some a>j+2 of magnitude at most . Let us now consider the evolution of such data for times < t < 0(log ^). 
Because the unstable leading mode c^j^i is already as large as ct, and is growing exponentially, we will no longer 



be in the perturbative regime covered by Lemma 3.2 However, we must necessarily stay within the region (3.23|; 
note that the coordinate singularity {bj = 0} cannot actually be reached via the flow, because as remarked earlier 



the support of 6 is an invariant of the flow. The bound (3.231 will still allow us to use Gronwall-type arguments 



for a time period of 0(log ^), incurring (quite tolerable) losses which are polynomial in 1/ct. (This is in contrast 
to the analysis of the previous sections, where such a crude argument would cost unacceptable factors of .) 

For the rest of this section we shall restrict the time variable t to the range < i < 10 log ^ (say) . From 
(3.171, (3.23) (and (3.19) for the trailing secondary mode) we have the crude estimates 

dtc^j+i = 0{\c^j+i\) 

so by Gronwall we have the crude bounds 

(3.56) c^,+i(i) = 0(^T-4"e-2^^). 

We will return to establish more accurate bounds on the leading peripheral modes c>j+2 shortly, but for now let 
focus attention on the leading secondary modes cjj^i, '^j+i- The stable leading mode cj^^ can be controlled by 



us 



(3.20), which by (3.56) and (3.23) becomes 



tO(1) 



From Gronwall's inequality we conclude that 
(3.57) 



Actually, as in previous sections, the analysis can proceed without knowing the explicit form of this solution, only its qualitative 
properties, namely that it propagates from Tj to Tj+i using an unstable mode of the former and a stable mode of the latter. But as 
the solution is so simple, we will take advantage of its explicitness. 
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Now we turn to the most important non-primary mode, namely the unstable leading mode c^+j^. Because we are 
no longer in the perturbative regime, the schematic equation (3.21) has a form which is a little bit too crude for 
our purposes. Instead we return to (3.19). Taking advantage of the bounds (3.57), (3.56) just obtained (as well 
as (3.23 1), we can take the c~^+i component of this equation and obtain 

Now let us define g to be the solution to the scalar ODE 
(3.58) dtg^Vi{l-\g?)g 
with initial data ^(0) = a. We can easily compute g explicitljp] as 

m = 

where the time is defined by the formula 



1 



1 



Vl + e2v^*o 



This is of course closely related to the slider solution (3.7) (see also Remark 3.4). Also observe that 

1 



ff(2io) 

and that 2to < 10 log ^ if cr is small enough. 



El 



As in previous sections, we now use the ansatz 

c^+i = .9-1 ^j+i 

where (thanks to the boundedness of both g and c!jj_-^) the error E'^j^-^ obeys the equation 
with initial data E+{Q) = 0{T^^ e^'^'^). From Gronwall we thus have 



tO(1) 



and hence 
(3.59) 



i?;+i(i) = o(^r-ie 



-V3T 



In particular this (together with (3.57), (3.56)) implies that 



(3.60) 



0{c')^0{g') + 0{- 



•0(1) 



Now we can return to improve the control on the leading peripheral modes c>j+2- From (3.60), (3.56) and (3.17) 
we have 



1 



dtc>j+2 = ic>,+2 + C'(rc>j+2) + Oi-^^T'^'i e 
As in previous sections we approximate this flow by the linear equation 

dtu = iu + 0{ug^) 



"'^^For our analysis, the only property one needs of g is that it flows from a to y'l ~ in finite time (the fact that this time is 
0(log ^) is not essential to the argument). This is immediate from an inspection of the vector field associated to the ODE l |3.58| , 
but the argument via the explicit solution is equally brief. 
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where u{t) e ^ ^. This equation has a fundamental solution G>j+2{t) '■ ^ ^ ^ ^ for all t > 0; 
from the boundedness of g and Gronwall's inequality we have 

(3.61) |G>,+2(t)UG>,+2(t)"^| < ^ 



We use the ansatz 

where the error i?>j+2 obeys an equation of the form 

1 ~2yi+ -aysT^i 



with initial data E>j+2{0) = 0(T'^^ e'^^^). Fr om Gronwall's inequality we conclude 

E^,+2{t) = OiJ^T'^t e-'^'^) 

and hence 

(3.62) c>,+2(i) = e-^^^G>,+2(t)a>,+2 + 0{J^T'^t e~'^'^). 

We specialize the above bounds to the time t = 2to < 10 log ^, and conclude that 
c<,_i(2to) = 0(^T-^^%-^^^) 



1 



-7+1(2^0) = 0(^T-.e-^^-) 
c++i(2io) = yr^+0(^r^^"e-^^) 
c>,+2(2to) = e-2N/3T^^^.^^(2to)a>,+2 + ©(^T^^? e'^^/^^) 



Using (3.161, we conclude that 



The phase 9 is unspecified, but this will not concern us. We can now move back to the global coordinates 
&i, . . . , ^AT and conclude that 



b,^,{2to) = (v/T^ + ReO(^T^7e-^^)).2e'« + 0{J^T'^t e'^^'^) 

b>,+2{2to) = e^^e-2^^G>,+2(2to)a>,+2 + Oi^T'^t e-'^'^) 
where we have inserted real parts in front of some error terms for emphasis. 

We now recast this in terms of the local coordinates around '^j+i', to avoid confusion with the local coordinates 
Tj, we shall denote these new coordinates with tildes, thus f , 9, c<j_i, cj , , cj^2: c^+2' c>j+2. Firstly, bj+i{2to) 
is certainly non-zero (it has magnitude close to Vl — cr^), and an inspection of the phase of 6j+i(2to) shows that 
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We then conclude that 



rO(l 



5T(2to) = (T + 0( 



c+(2io) = 0( 



>j+2(2to)a>j+2 + Q( o(x) 



'2A7 



)• 



From this and (3.61 1 it is an easy matter to show that {AIJ]^^, dj^-^^, RJ_^i) is covered by (M^, d^, Rj), by choosing 
a>j+2 appropriately (note that any losses arising from ( |3.61 1 will be acceptable since is assumed to be much 
larger than rj_^_^ depending on a). 



4. Construction of the Resonant Set A 



Fix N > 2. We begin by constructing an abstract combinatorial model E = Ei U . . . U Eat for the collection 
A = Ai U . . . U Atv of frequencies. Each element xq G E corresponds to a unique element riQ € A and encodes both 
how Uf) is related to other elements of A, and (at least approximately) where Uq is located in Z^. The relationship 
between E and A will be made explicit later when we construct an embedding of E into and define A to be 
this image. Whereas A lives in the frequency lattice Z^, E will live in a more abstract set (C^~^, to be precise). 

To construct E, we define the standard unit square 5 C C to be the four-element set of complex numbers 

5" = {0,1,1 + «,«}• 

We split S" = 5*1 US'2, where Si :— and S2 ■— {0,l + i}. Note that this is already an = 2 model for the A's, 

if we identify the frequency lattice with the Gaussian integers Z[z] in the usual manner (ni, 712) ^ rii +m2. To 
create E we shall essentially take a large power of this unit square example. More precisely, for any 1 < j < A^, we 
define Ej C C^"""^ to be the set of all A^— 1-tuples (zi, . . . , zat-i) such that zi, . . . , Zj„i S S2 and Zj, . . . , zjv-i G Si. 
In other words. 

Note that each Ej consists of 2^~^ elements, and they are all disjoint. We then set E = Ei U . . . U Eat; this set 
consists of N2^~^ elements. We refer to Ej as the j*'' generation of E. 

For each 1 < j < A^, we define a combinatorial nuclear family connecting generations Ej,Ej_(,i to be any 
four-element set F C Ej U Ej+i of the form 

F := {(zi, . . . ,Zj_i,w,Zj+i, . . . , zn) ■■ w e S} 

where zi, . . . , Zj_i G ^2 and Zj+i, . . . , z^r G S*!. In other words, we have 

F= {Fo,Fi,Fi+,,F,;} == {(zi,...,Zj_i)} X S X {(z^+i, . . . , zat)} 

where = (zi, . . . , Zj„i, w, Zj+i, . . . , ZAf). It is clear that is a four-element set consisting of two elements 
Fi,Fi of Ej (which we call the parents in F) and two elements -Fbi-fi+j of Ej+i (which we call the children in 
F). For each j there are 2^~^ combinatorial nucleax families connecting the generations Ej and Ej+i. One easily 
verifies the following properties: 

• (Existence and uniqueness of spouse and children) For any 1 < j < A^ and any x G Ej there exists a 
unique combinatorial nuclear family F connecting Ej to ^j+i such that a; is a parent of this family (i.e. 
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X — Fi or X — Fi). In particular each x G T,j has a unique spouse (in Sj) and two unique children (in 

• (Existence and uniqueness of sibling and parents) For any 1 < j < N and any y G S^+i there exists a 
unique combinatorial nuclear family F connecting T,j to S^+i such that y is a child of the family (i.e. 
y = Fq or y = -Fi+i). In particular each y G ^j+i has a unique sibling (in Sj+i) and two unique parents 
(in S^). 

• (Nondegeneracy) The sibling of an element x € T,j is never equal to its spouse. 



Example. If TV = 7, the point a; = (0, 1 + i,0,i,i, 1) lies in the fourth generation E4. Its spouse is (0, 1 + 
i, 0, 1, i, 1) (also in S4) and its two children are (0, 1 + i,0, 0, i, 1) and (0, 1 + i,0,l + 1) (both in S5). These 
four points form a combinatorial nuclear family connecting the generations E4 and S5. The sibling of x is 
(0,1 + 1,1 + i, i, i, 1) (also in E4, but distinct from the spouse) and its two parents are (0,1 + i, 1, i, i, 1) and 
(0, 1 + i, i, 1) (both in S3). These four points form a combinatorial nucleax family connecting the generations 
S3 and S4. Elements of Si do not have siblings or parents, and elements of S7 do not have spouses or children. 

Now we need to embed S into the frequency lattice Z-^. We shall abuse notation and identify this lattice 
with the Gaussian integers Z[i\ in the usual manner. We shall need a number of parameters: 



• (Placement of initial generation) We will need a function /i : Si ^ C which assigns to each point x in 
the first generation, a location /i (x) in the complex plane (eventually we will choose / to take values in 
the Gaussian integers). 

• (Angle of each nuclear family) For each 1 < j < N and each combinatorial nuclear family F connecting 
the generations S^ and S^+i, we need an angle 9{F) e ]R/27rZ. 

Given the placement function of the first generation, and given all the angles, we can then recursively define 
the placement function fj : Sj ^ C for later generations 2 < j < iV by the following rule: 

• If 1 < j < A'' and fj : S^ — > C has already been constructed, we define /j+i : Sj+i — > C by requiring 

1 + J6{F) 1 _ e^6i.F) 

fj+iiFi+.) = MFi) + 2 ^'^^'^ 

1 + e^^(^) 1 - e*^(^) 
fj+i{Fo) = fj{Fi) 2 ^^(^^^ 

for all combinatorial nuclear families F connecting Sj to Sj+i. 



In other words, we require /j(-Fi), /j+i(-Fi+i), /j(-Fi), /j+i(Fo) to form the four points of a rectangle in C, with 
the long diagonals intersecting at angle 9{F). 

Note that this definition is well-defined thanks to the existence and uniqueness of parents. 

A model example: Let 71 be a large integer, and let the initial placement function be 

/i(zi, . . . , ZAr_i) := TZzi . . . zat-i G {TZ, iTZ, —TZ, —iTZ} for all {zi, . . . , Zm-i) G Si. 
Set all the angles 0{F) to equal 7r/2. Then one can show inductively that 

/j(zi, . . . , ZN^i) := 7^Zl . . . ZN-i e {0, i{l+iy-^n, -{l+iy~^n, -i{l+iy-^n} for all (zi, . . . , ZN-i) € 

Note that as the generations increase, an increasing majority of the points in that generation will get mapped 
to zero by the placement function, but an increasingly small minority will get mapped to larger and larger 
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frequencies. For the final generation Sjv, there is a single element (1 + i, . . . , 1 + i) which is mapped to the 
very large frequency (1 + i)^TZ, whereas all the other 2^^^ — 1 frequencies are mapped to zero. Notice that in 
each combinatorial nuclear family F = {Fq, Fi^ Fi^i^ Fi} connecting Sj to S^+i, the two parents Fi and Fi get 
mapped to frequencies of equal magnitude (indeed fj{Fi) = ifj(Fi)), but of the two children, Fq and i^i+i, one 
of the children (the "under-achiever" Fq) gets mapped to 0, whereas the other child (the "over-achiever" Fi+i) 
gets mapped to a frequency of magnitude as large as that of its parents. Our NLS solution will distribute 
mass evenly from Fi and Fi to Fq and i^i+i, but will distribute the energy from Fi and Fi almost entirely to 
Fi^i, thus sending the energy to increasingly high frequencies. Indeed by simply counting the number of nonzero 
frequencies in each generation, one easily verifies that 

, |2. ^ 2^(^-3)+27^23 



E 

•7,G/(En. 



2) 



and 



E 

"G/(S3) 



2s _ 22s+N-3'rp2s 



and so there is a norm explosion by a factor of 2*^^ ^■''^^ ^\ 



We define / : E — *■ C to be the function formed by concatenating the individual functions fj : Y,j 
function / will be referred to as the complete placement function. 



The 



Figure I?] contains a sketch of the complete placement function's image in the case N = 5. (We emphasize 
that this function is not injective - for example every generation after the first has several 4-tuples mapped to the 
origin.) 
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Figure 7. The initial placement of frequencies in the case = 5. 



The model example turns out not to be directly usable for us because it is highly non-injective, and also 
contains a large number of unwanted rectangles. But this can be fixed by a perturbation argument: 

Theorem 4.1 (Construction of a good placement function). Let N > 2, s > I, and let TZ be a sufficiently large 
integer (depending on N). Then there exists an initial placement function /i : Si ^ C and choices of angles 9{F) 
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for each nuclear family F (and thus an associated complete placement function / : S — > with the following 
properties: 

• (Nondegeneracy) The function f is injective. 

• (Integrality) We have /(S) C Z[i]. 

• (Magnitude) We have C{N)-^n < \f{x) \ < C{N)'R for all a; G S. 

• (Closure and Faithfulness) If xi,X2,X3 g E are distinct elements of Y. are such that f{xi),f{x2),f{x3) 
form the three corners of a right-angled triangle, then xi,X2,X3 belong to a combinatorial nuclear family. 

• (Norm explosion) We have 

ne/(Siv-2) "e/(S3) 

Providing this theorem is true, we see that the frequency set A := /(E), with generations A,- := /(Sj), obey 
all the required properties described in Section [2] above (See e.g. the statement of Proposition [2!T] ) 

Proof. To prove this theorem, we begin by a number of reductions which allow us to remove several of the 
constraints required on the placement function. 

First we remove the requirement that the construction work for TZ sufficiently large, by observing that as soon 
as we obtain an example involving a single TZ — TZq, we also get examples for any integer multiple TZ — kTZo of 
TZq simply by multiplying /i (and hence /) by k. Since TZ only appears in the magnitude hypothesis, we also get 
the same claim for any TZ > TZq since any such TZ is comparable up to a factor of two to an integer multiple of TZ. 

Thus we only need to exhibit an example for a single TZ = TZ{N). But then the magnitude hypothesis would 
follow from the weaker hypothesis 

• (Non-zero) f{x) ^ for all x G E. 

and we can now drop the magnitude hypothesis. 

Next, we can weaken the integrality hypothesis /(S) C Z[i] to a rationality hypothesis /(E) C Q[i], since 
by dilating /i (and hence /) by a suitable large integer (the least common denominator of all the coefficients of 
/(S)). 

Now observe that in order for /(E) to lie in the complex rationals Q[i], it would suffice (by an easy induc- 
tion argument) for the initial placement function fi to take values in Q[i], and for all the phases 9{F) to be 
Pythagorean, i.e. e'**^''^^ G Q[i]. (A typical example of a Pythagorean phase is tan"-"^ |). So we may replace the 
rationality hypothesis by the hypothesis that the initial placement function is complex-rational and all the phases 
are Pythagorean. 

However, the complex rationals Q[i] are dense in C, and the Pythagorean phases are dense in M/27rZ (indeed 
they form an infinite subgroup of this unit circle) . Thus by a perturbation argument we may dispense with these 
conditions altogether. Note that the remaining conditions required are all open conditions, in that they are stable 
under sufficiently small perturbations of the parameters. Thus any non-rational solution to the above problem 
can be perturbed into a rational one. 



48 



J. COLLIANDER, M. KEEL, G. STAFFILANI, H. TAKAOKA, AND T. TAO 



We have now eliminated the integrality and magnitude hypotheses. We now work on eliminating the non-zero, 
injectivity, and closure/faithfulness hypotheses. The point is that in the parameter space (the space of initial 
placement functions /i and phases 0(F)), the set of solutions to the norm explosion property is certainly an open 
set, so it is either empty or has positive measure. So it suffices to show that the sets where each of the non-zero, 
injectivity, and closure/faithfulness fail is a measure zero set, and thus they have no impact as to whether a 
solution actually exists or not. 

Let's begin with the injectivity condition. We need to show that for each distinct x,y € T,, that we have 
f{x) ^ /(y) for almost every choice of parameters /i, 6. Without loss of generality we may take x e T,j and 
y e T,ji where j > j'. We induct on j. When j = 1 the claim is clear since in that case f{x) = fi{x) and 
f{y) = .fi{y): ^uifl since there are absolutely no constraints on /i we see that fi{x) ^ fi{y) for almost every 
choice of /i (the angles 0{F) are not relevant in this case). 

Now suppose j > 1. Since x G Sj, there is a unique combinatorial family F :~ {x, x' ,p,p'} where p.p' G 
are the parents of x and x' is the sibling of x. By induction hypothesis, we have f{p) ^ f{p') for almost every 
choice of parameters. Now note that f{x) lies on the circle with diameter f{p), f{p'), with the location on this 
circle determined by the angle 9{F). This angle 0{F) is a free parameter and will not influence the value of ,f{y), 
unless y is equal to the sibling x' of x. But in the latter case f{y) = f{x') will be diametrically opposite to f{x) 
and thus unequal to f{x) (since f{p) ^ f{p'))- In all other cases we see that for almost every choice of 9{F), f{x) 
will not be equal to f{y), simply because a randomly chosen point on a circle will almost surely be unequal to 
any fixed point. This establishes injectivity almost everywhere. 

A similar argument establishes the non-zero property almost everywhere: if x is in the first generation Si then 
it is clear that f{x) = fi{x) ^ for almost every /i, and for x in any later generation, f{x) again ranges freely 
on the circle generated by its two parents f{p), f{p') (which are distinct for almost every choice of parameters), 
determined by some angle parameter 6{F) and thus will be non-zero for almost all choices of 0{F). 

Now we establish closure and faithfulness. Suppose x G Ej^ , y e Ej^ , z G Sj^ with jx > jy > jz are distinct 
and do not all belong to the same combinatorial nuclear family. We need to show that f{x), f{y), ,f{z) do not form 
a right-angled triangle (with either f{x), f{y), or f{z) being the right angle) for almost all choice of parameters. 

We induct on j^- When = 1, we have f{x) = .fi{x), f{y) = fi{y), f{z) = fi{z) and since /i is completely 
unconstrained it is clear that these three points will not form a right-angled triangle for almost all choices of 
parameters. Now we assume inductively that > 1 and that the claim has already been proven for smaller 
values of jx- 

As before, x belongs to a family F = {x, x' ,p,p'} consisting of x, its sibling, and its parents, ranges freely on 
the circle C with diameter f{p), f{p') g("nerated by its two parents, with the position on this circle determined by 
the angle 9{F)\ recall that f{p) ^ f{p') for almost all choices of parameters. This angle 9{F) will not influence 
either of f{y) or f{z) unless y or z is equal to the sibling a;', in which case f{x') is the point diametrically opposite 
to f(x). To proceed further, we next claim that the circle C contains no elements of Ai U . . . U Aj^ other than 
f{x),f{x'),f{p),f{p'). To see this for the first jx — 1 generations Ai U . . . U Aj^_i follows from the induction 
hypothesis, because if C contained a point f{u) from one of those generations then ,f{p), .f{p'), .f{u) would be 
a right-angled triangle, contradicting the induction hypothesis for almost all choices of parameters. To see the 
claim for the last generation Aj^, we note that any point f{v) from that generation ranges on another circle C 
spanned by the parents q, q' of v; since f{q) and f{q') do not lie on C we know that C is not co-incident to C, 
and so for almost all choices of the angle parameter determining f{v) from f{q) and f{q') we know that f{v) 
does not lie on C. 
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To summarize, we know that f{x) varies freely on the circle C as determined by the angle 9{F), that f{x') is 
diametrically opposite to f{x), that f{p), f{p') are also diametrically opposite points on C, and all other points 
in Ai U . . . U Aj^ lie outside of C and are not affected by 0{F) (for almost all choices of parameters). Since 
{a;, y,z} (t F — {x, x' ,p,p'} by hypothesis, this shows that for almost all choices of 9{F), f{x), f{y), f{z) do not 
form a right-angled triangle. 

From the preceding discussion we see that all we need to do now is establish a single example of parameters 
which exhibits the norm explosion property. But this was already achieved by the model example, and we are 
done. 

□ 



5. Appendix: Cubic NLS on has No Asymptotically Linear Solutions 



Consider solutions u : M x ^ C to the defocusing periodic NLS 
(5.1) — iut + Au — \u\'^u 

on the torus T^. We do not expect such solutions to scatter to a free solution e~'*^U4 



Indeed, the explicit 



solutions to (5.1 ) 
(5.2) 



i{t,x) = ^e*''e™-^e*l"l'*e*l^l'* 



for A > 0, K e M/27rZ, n e do not converge to a free solution, due to the presence of the phase rotation 
gi\A\ t caused by the nonlinearity. One may still hope that a solution to (5.11 scatters "modulo phase rotation". 



However, the following result shows that this is only the case for the explicit solution (5.2) mentioned above 



Theorem 5.1 (No non-trivial scattering modulo phase rotation). Let w : M x - 
which scatters modulo phases in in the sense that there exists u+ G H^iT"^) and a function 9 
such that 

||u(t)-e'^(')e-'*^ii+||ffi(M.)^0 
as t +0O. Then u is of the form (5.2 1 for some A>0, k G M/27rZ, n G Z'^. 



be an solution to ( |5.1[ | 
^ M/27rZ 



To prove this theorem, we first need some harmonic analysis lemmas. 



Lemma 5.1 (Compactness). For any G H^iT"^), the set {e*^e 
totally hounded) in H^. 



vrZji G M} is precompact (i.e. 



Proof. By taking Fourier transforms, we see that it suffices to show that the set {e*^e**I I^M^ : 9 G M/27rZ, i G M} 
is precompact in P{{n)'^ dn). But by monotone convergence, for every e > there exists i? > such that 

^ (n)2|5;(n)p<e 

neX^:\n\>R 

from which we easily conclude that the set {e*^e**l'I^CjI : 9 G M/27rZ,< G M} is covered by finitely many balls of 
radius 0(e), and the claim follows. □ 

Lemma 5.2 (Diamagnetic inequality). If u G i?^(T^), then \u\ G H^iT'^) and || |m| ||//i(t2) < |lu||/fi(T2) . 
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Proof. By a limiting argument (and Fatou's lemma) it sufRces to verify this when u is smooth. Observe that for 

any e > we have 

2|v/£2 + |u|2Vv/e2 + |u|2| = |V(£2 + \u\^)\ 

= 2|Re(uVM)| 
< 2|u||Vm| 

and hence 

|Vv/e2 + |w|2| < |Vu|. 

(This can also be seen by observing that the map u t-^ ^ + |up is a contraction.) Taking distributional limits 
as £ ^ we obtain the claim. □ 

Lemma 5.3 {H^ has no step functions). Let u e -^^(T^) he such that u{x) takes on at most two values. Then u 
is constant. 



Proof. Without loss of generality we may assume that u takes on the values and 1 only, thus = u. Differen- 
tiating this we obtain 2uWu = Vm, thus (1 — 2u)Wu = 0. But since = u, (1 — 2u)^ = 1, and so Vu = 0, and so 
u is constant as required. (Note that all these computations can be justified in a distributional sense via Sobolev 
embedding since u lies in both and L°°, and thus lies in LP{T'^) for every 1 < p < oo.) □ 



Proof of Theorem \5.1\ Let u,u^,6 be as above. We may of course assume that u has non-zero mass. From 
Lemma 5.1 (and the continuity in of the solution t i-^ u{t)) we see that the set {u{t) : < t < +00} is 

precompact in H\ Thus we can find a sequence tm —> 00 such that u(t„i) converges in to some limit vq. 

and passing to a subsequence, we can also assume that e~'^*'"^^u+ converges in to a limit 



5.1 



Applying Lemma 

v^. Since u has non-zero mass, we see on taking limits that also has non-zero mass. 

Now let : M X ^ C be the time-translated solutions v^'^^t) u{t + i,„)> thus i;(™)(0) converges in 
to Vq. Let w : M X ^ C be the solution to NLS with initial data v{0) = vq. By the local well-posedness 

theory we conclude that converges uniformly in to v on every compact time interval [— T, T]. On the 

other hand, by hypothesis, for every t we have 

as m ^ 00. Since e~**™'^M+ converges to w+, we conclude (since e^**^ is unitary on H^) that 

||„(™)(t) _ e*«(*+*")e-'*^v+||jji 

On taking limits, we conclude that 



(5.3) 



v{t,x)^e"'^'^e-'*^v+{x) 



for some a{t) G ]R/27rZ. In particular, since v+ has non-zero mass, we have the identity 

1 



L2 JT^ 



v(t)e dx. 



From (5.11 and Sobolev embedding we see that v{t) is continuously differentiable in H ^(T^), and so from the 
above identity we see that a is continuously differentiable in time. 



Now we apply {—idt + A) to both sides of (5.3 1. Using the NLS equation, we conclude that 

\vit,x)fv{t,x) = a'(t)e'"(*)e-'*^v+(x) 



WEAKLY TURBULENT SOLUTIONS FOR NLS 



51 



and thus by (5.3) we have 



\v{t,x)\'^ = a'{t) 



whenever v(t,x) ^ 0. Thus we see that for each time t, |w(i,a;)| takes on at most two values. By Lemma 5.2 and 
Lemma 5.3 we conclude that |u(t, a;)| is constant in x; by (5.3), we see that the same is true for |e~**'^u+|. By 
mass conservation we conclude that |e~'*'^u+| is also constant in time. Since has non-zero mass, we can thus 
write 

for some phase : M x ^ M./2'kZ and some constant amplitude A > 0. Since v+ was in H^, we see that ui is 
in also. Applying {—idt + A) to both sides, we conclude that 

= Ae-'"^{LUt + iAuj - iVwp) 

in the sense of distributions. Since A is non-zero, we conclude that 

LUt + iAuj - |Vcjp = 0. 

Taking imaginary parts we conclude that Alu = 0, and in particular at time < = is a harmonic map from to 
]R/27rZ. Using monodromy we can lift cj to a harmonic function from to M of at most linear growth, and thus 
(by Liouville's theorem) oj must in fact be linear. Descending back to T'^, we conclude that uj{0,x) = n ■ x + f3 
for some n and /3 G M/27rZ. Thus we have 



Since e^*'"^u^ converges to v+, e 
we have 



v+{x) = Ae^/'e*"-^. 

-itm^y^ converges to uj^. But e~'*'"'^w+ is a multiple of Ae*"'^ by a phase; thus 
u+{x) = Ae*'^e"-=". 



Applying phase rotation and Galilean symmetry (noting that the conclusion of the theorem is invariant under 
these symmetries) we may assume 7 = n = 0. Thus = A, and we have 

\\u{t) - e*^(*)e""'^^||^i(T2) ^ as i -> 00. 

From mass and energy conservation we conclude 

/ \u{t,x)\^ dx = A^\T''\ 

and 



l^^^\Vu{t,x)\' + l\u{t,x)\^ dx^^A^lT^ 



On the other hand, from Holder's inequality we have 



\\u{t,xr dx > -1^( / \u{t,x){' dxY = ^-A^\t\ 



Thus we must have 



/ \\Vu{t,x)\'' dx^O, 

thus u is constant in space, and thus is of the form u{t,x) = Ae*'''*-'. Applying (5.1) we see that u{t,x) 



^g»K(o)gi|^| t g^^^ claim follows. 



□ 
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